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Enterococcus faecalis polvnucleotides and polypeptides 

Field of the Invention 

The present invention relates to novel Enterococcus faecalis genes {E. faecalis) 
5 nucleic acids and polypeptides. Also provided are vectors, host cells and recombinant 
methods for producing the same. Further provided are diagnostic methods for 
detecting Enterococcus faecalis using probes, primers, and antibodies to the E, faecalis 
nucleic acids and polypeptides of the present invention. The invention further relates 
to screening methods for identifying agonists and antagonists ofE, faecalis 
10 polypeptide activity and to vaccines using E, faecalis nucleic acids and polypeptides. 

Background of the Invention 

Enterococci have been recognized as being pathogenic for humans since the 
turn of the century when they were first described by Thiercelin in 1988 as 

15 microscopic organisms. The genus Enterococcus includes the species Enterococcus 
faecalis or E. faecalis which is the most common pathogen in the group, accounting for 
80 - 90 percent of all enterococcal infections. See Lewis et al, (1990) Eur J. Clin 
Microbiol Infect Dis.9:l 11-117. 

The incidence of enterococcal infections has increased in recent years and 

20 enterococci are now the second most frequently reported nosocomial pathogens. 

Enterococcal infection is of particular concern because of its resistance to antibiotics. 
Recent attention has focused on enterococci not only because of their increasing role in 
nosocomial infections, but also because of their remarkable and increasing resistance to 
antimicrobial agents. These factors are mutually reinforcing since resistance allows 

25 enterococci to survive in an environment in which antimicrobial agents are heavily 
used; the hospital setting provides the antibiotics which eluninate or suppress 
susceptible bacteria, thereby providing a selective advantage for resistant organisms, 
and the hospital also provides the potential for dissemination of resistant enterococci 
via the usual routes of hand and environmental contamination. 
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Antimicrobial resistance can be divided into two general types, inherent or 
intrinsic property and that which is acquired. The genes for intrinsic resistance, like 
other species characteristics, appear to reside on the chromosome. Acquired 
resistance results from either a mutation in the existing DNA or acquisition of new 

5 DNA. The various inherent traits expressed by enterococci include resistance to 
semisynthetic penicillinase-resistant penicillins, cephalosporins, low levels of 
aminoglycosides, and low levels of clindamycin. Examples of acquired resistance 
include resistance to chloramphenicol, erythromycin, high levels of clindamycin, 
tetracycline, high levels of aminoglycosides, penicillin by means of penicillinase, 

10 fluoroquinolones, and vancomycin. Resistance to high levels of penicillin without 
penicillinase and resistance to fluoroquinolones are not known to be plasmid or 
transposon mediated and presumably are due to mutation(s). 

Although the main reservoir for enterococci in humans is the gastrointestinal 
tract, the bacteria can also reside in the gallbladder, urethra and vagina. 

1 5 E, faecalis has emerged as an important pathogen in endocarditis, bacteremia, 

urinary tract infections (UTIs), intraabdominal infections, soft tissue infections, and 
neonatal sepsis. See Lewis et al. (1990) ^upra.. In the 1970s and 1980s enterococci 
became firmly established as major nosocomial pathogens. They are now the fourth 
leading cause of hospital-acquired infection and the third leading cause of bacteremia in 

20 the United States. Fatality ratios for enterococcal bactermia range from 12% to 68%, 
with death due to enterococcal sepsis in 4 to 50% of these cases. See T.G. Emori 
(1993) Clin. Microbiol. Rev. 6:428-442. 

The ability of enterococci to colonize the gastrointestinal tract, plus the many 
intrinsic and acquired resistance traits, means that these organisms, which usually 

25 seem to have relatively low intrinsic virulence, are given an excellent opportunity to 
become secondary invaders. Since nosocomial isolates of enterococci have displayed 
resistance to essentially every useful antimicrobial agent, it will likely become 
increasingly difficult to successfully treat and control enterococcal infections. 
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Particularly when the various resistance genes come together in a single strain, an 
event almost certain to occur at some time in the future. 

The etiology of diseases mediated or exacerbated by Enterococcus faecalis, 
involves the programmed expression of E. faecalis genes, and that characterizing these 
5 genes and their patterns of expression would dramatically add to our understanding of 
the organism and its host interactions. Knowledge of the E, faecalis gene and genomic 
organization would improve our understanding of disease etiology and lead to 
improved and new ways of preventing, treating and diagnosing diseases. Thus, there 
is a need to characterize the genome of E. faecalis and for polynucleotides of this 
10 organism. 

Summary of the Invention 

The present invention provides for isolated faecalis polynucleotides and 
polypeptides shown in Table 1 and SEQ ID NO: 1 through SEQ ID NO:496 

15 (polynucleotide sequences having odd SEQ ID NOs and polypeptide sequences 

having even SEQ ID NOs). One aspect of the invention provides isolated nucleic acid 
molecules comprising polynucleotides having a nucleotide sequence selected from the 
group consisting of: (a) a nucleotide sequence shown in Table 1 ; (b) a nucleotide 
sequence encoding any of the amino acid sequences of the polypeptides shovm in 

20 Table 1 ; and (c) a nucleotide sequence complementary to any of the nucleotide 

sequences in (a) or (b). The invention further provides for fragments of the nucleic 
acid molecules of (a), (b) & (c) above. 

Further embodiments of the invention include isolated nucleic acid molecules 
that comprise a polynucleotide having a nucleotide sequence at least 90% identical, 

25 and more preferably at least 95%, 96%, 97%, 98% or 99% identical, to any of the 
nucleotide sequences in (a), (b) or (c) above, or a polynucleotide which hybridizes 
under stringent hybridization conditions to a polynucleotide in (a), (b) or (c) above. 
Additional nucleic acid embodiments of the invention relate to isolated nucleic acid 
molecules comprising polynucleotides which encode the amino acid sequences of 
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epitope-bearing portions of a E, faecalis polypeptide having an amino acid sequence in 
(a) above. 

The present invention also relates to recombinant vectors, which include the 
isolated nucleic acid molecules of the present invention, and to host cells containing 
5 the recombinant vectors, as well as to methods of making such vectors and host cells. 
The present invention further relates to the use of these vectors in the production of 
E, faecalis polypeptides or peptides by recombinant techniques. 

The invention further provides isolated E. faecalis polypeptides having an 
amino acid sequence selected from the group consisting of an amino acid sequence of 
10 any of the polypeptides described in Table 1 or fragments thereof. 

The polypeptides of the present invention also include polypeptides having 
an amino acid sequence with at least 70% similarity, and more preferably at least 75%, 
80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% similarity to those described in Table 
1, as well as polypeptides having an amino acid sequence at least 70% identical, more 
15 preferably at least 75% identical, and still more preferably 80%, 85%, 90%, 95%, 
96%, 97%, 98%, or 99% identical to those above; as well as isolated nucleic acid 
molecules encoding such polypeptides. 

The present invention further provides a single or multi-component vaccine 
comprising one or more of the E. faecalis polynucleotides or polypeptides described 
20 in Table 1 , or fragments thereof, together with a pharmaceutically acceptable diluent, 
carrier, or excipient, wherein the E, faecalis polypeptide(s) are present in an amount 
effective to elicit an immune response to members of the Enterococcus genus, or at 
least E. faecalis , in an animal. The E, faecalis polypeptides of the present invention 
may further be combined v/ith one or more immunogens of one or more other 
25 Enterococcal or non-Enterococcal organisms to produce a multi-component vaccine 
intended to elicit an immunological response against members of the Enterococcus 
genus and, optionally, one or more non-Enterococcal organisms. 

The vaccines of the present invention can be administered in a DNA form, e.g., 
"naked" DNA, wherein the DNA encodes one or more Enterococcal polypeptides 
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and, optionally, one or more polypeptides of a non-Enterococcal organism. The DNA 
encoding one or more polypeptides may be constructed such that these polypeptides 
are expressed as fusion proteins. 

The vaccines of the present invention may also be administered as a 
5 component of a genetically engineered organism or host cell. Thus, a genetically 

engineered organism or host cell which expresses one or more E.faecalis polypeptides 
may be administered to an animal. For example, such a genetically engineered 
organism or host cell may contain one or more E. faecalis polypeptides of the present 
invention intracellularly, on its cell surface, or in its periplasmic space. Further, such 
1 0 a genetically engineered organism or host cell may secrete one or more E. faecalis 

polypeptides. The vaccines of the present invention may also be co-administered to 
an animal with an immune system modulator (e.g., CD86 and GM-CSF). 

The invention also provides a method of inducing an immunological response 
in an animal to one or more members of the Enterococcus genus, preferably one or 
15 more isolates of the E, faecalis species, comprising administering to the animal a 
vaccine as described above. 

The invention further provides a method of inducing a protective immime 
response in an animal, sufficient to prevent, attenuate, or control an infection by 
members of the Enterococcus genus, preferably at least E. faecalis species, 
20 comprising administering to the animal a composition comprising one or more of the 
polynucleotides or polypeptides described in Table 1, or fragments thereof. Further, 
these polypeptides, or fragments thereof, may be conjugated to another immunogen 
and/or administered in admixture with an adjuvant. 

The invention further relates to antibodies elicited in an animal by the 
25 administration of one or more E. faecalis polypeptides of the present invention and to 
methods for producing such antibodies and fragments thereof The invention further 
relates to recombinant antibodies and fragments thereof and to methods for producing 
such antibodies and fragments thereof 

The invention also provides diagnostic methods for detecting the expression of 
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the polynucleotides of Table 1 by members of the Enterococcus genus in an animal. 
One such method involves assaying for the expression of a polynucleotide encoding 
E.faecalis polypeptides in a sample from an animal. This expression may be assayed 
either directly (e.g., by assaying polypeptide levels using antibodies elicited in 
5 response to amino acid sequences described in Table 1) or indirectly (e.g., by assaying 
for antibodies having specificity for amino acid sequences described in Table 1). The 
expression of polynucleotides can also be assayed by detecting the nucleic acids of 
Table 1 . An example of such a method involves the use of the polymerase chain 
reaction (PGR) to amplify and detect Enterococcus nucleic acid sequences. 

10 The present invention also relates to nucleic acid probes having all or part of a 

nucleotide sequence described in Table 1 (odd SEQ ID NOs) which are capable of 
hybridizing under stringent conditions to Enterococcus nucleic acids. The invention 
further relates to a method of detecting one or more Enterococcus nucleic acids in a 
biological sample obtained from an animal, said one or more nucleic acids encoding 

15 Enterococcus polypeptides, comprising: (a) contacting the sample with one or more 
of the above-described nucleic acid probes, under conditions such that hybridization 
occurs, and (b) detecting hybridization of said one or more probes to the Enterococcus 
nucleic acid present in the biological sample. 

Other uses of the polypeptides of the present invention include: inter alia, to 

2n detect E, fapraiig in immunoassays, as epitope tags, as molecular weight markers on 
SDS-PAGE gels, as molecular weight maricers for molecular sieve gel filtration 
columns, to generate antibodies that specificaly bind E.faecalis polypeotides of the 
present invention for the detection E.faecalis in immunoassays, to generate an 
immune response against E, faecalis and other Enterococcus species, and as vaccines 

25 against £. faecalis, other Enterococcus species and other bacteria genuses. 

Isolated nucleic acid molecules of the present invention, particularly DNA 
molecules, are useful as probes for gene mapping and for identifying E.faecalis in a 
biological samples, for instance, by Southern and Northern blot analysis. 
Polynucleotides of the present invention are also useful in detecting E.faecalis by 
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PCR using primers for a particular E, faecalis polynucleotide. Isolated 
polynucleotides of the present invention are also useful in making the polypeptides of 
the present invention. 

5 Detailed Description 

The present invention relates to recombinant E. faecalis nucleic acids and 
fragments thereof. The present invention further relates to recombinant £. faecalis 
polypeptides and fragments thereof. The invention also relates to methods for using 
these polypeptides to produce immunological responses and to confer immunological 

10 protection to disease caused by members of the genus Enterococcus, at least isolates 
of the E, faecalis genus. The invention further relates to nucleic acid sequences which 
encode antigenic E. faecalis polypeptides and to methods for detecting E, faecalis 
nucleic acids and polypeptides in biological samples. The invention also relates to 
antibodies specific for the polypeptides and peptides of the present invention and 

15 methods for detecting such antibodies produced in a host animal. 



Definitions 

The following definitions are provided to clarify the subject matter which the 
inventors consider to be the present invention. 
20 As used herein, the phrase "pathogenic agent" means an agent which causes a 

disease state or affliction in an animal. Included within this definition, for examples, 
are bacteria, protozoans, fimgi, viruses and metazoan parasites which either produce a 
disease state or render an animal infected with such an organism susceptible to a 
disease state (e.g., a secondary infection). Further included are species and strains of 
25 the genus Enterococcus which produce disease states in animals. 

As used herein, the term "organism" means any living biological system, 
including viruses, regardless of whether it is a pathogenic agent. 

As used herein, the term "Enterococcus" means any species or strain of 
bacteria which is members of the genus Enterococcus. Such species and strains are 
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known to those of skill in the art, and include those that are pathogenic and those that 
are not. . 

As used herein, the phrase "one or more E.faecalis polypeptides of the 
present invention" means polypeptides comprising the amino acid sequence of one or 

5 more of the E.faecalis polypeptides described in Table 1 (even SEQ ID NOs). These 
polypeptides may be expressed as fusion proteins wherein the E, faecalis 
polypeptides of the present invention are linked to additional amino acid sequences 
which may be of Enterococcal or non-Enterococcal origin. This phrase further 
includes polypeptide comprising fragments of the E.faecalis polypeptides of the 

10 present invention. Additional definitions are provided throughout the specification. 

Explanation of Table 1 

Table 1 , below, provides information describing genes which encode 
polypeptides of E.faecalis, The table lists the gene identifier which consists of the 

15 letters EF, which denote E.faecalis, followed inunediately by a three digit numeric 
code, which arbitrarily number the E.faecalis genes of the present invention. A 
number fi-om 1 through 4 follows the three digit number. A nimiber 1 represents the 
ftill length open reading frame of the gene specified by the proceeding three digit 
number. A number 2 represents the full leng+h polypeptide encoded by the gene 

20 specified the preceeding three digit number. A number 3 represents a polynucleotide 
fragment, of the gene represented by the preceeding three digit number, used to 
produce an antigenic polypeptide. A number 4 represents an antigenic polypeptide 
fragment, of the gene represented by the preceeding three digit number, used to 
stimulate an immune response or as a vaccine. The nucleotide and amino acid 

25 sequences of each gene and fragment are also shown in the Sequence Listing under the 
SEQ ID NO listed in Table 1. 

Explanation of Table 2 

Table 2 lists accession numbers for the closest matching sequences between 
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the polypeptides of the present invention and those available through GenBank and 
Derwent databases. These reference numbers are the database entry numbers 
commonly used by those of skill in the art, who will be familar with their 
denominations. The descriptions of the numenclature for GenBank are available from 
5 the National Center for Biotechnology Infomiation. Column 1 lists the gene or ORF 
of the present invention. Column 2 lists the accession number of a "match" gene 
sequence in GenBank or Derwent databases. Column 3 lists the description of the 
"match" gene sequence. Columns 4 and 5 are the high score and smallest sum 
probability, respectively, calculated by BLAST. Polypeptides of the present 
10 invention that do not share significant identity/similarity with any polypeptide 

sequences of GenBank and Derwent are not represented in Table 2. Polypeptides of 
the present invention that share significant identity/similarity with more than one of 
the polypeptides of GenBank and Derwent are represented more than once. 

15 Explanation of Table 3. 

The E. faecalis polypeptides of the present invention may include one or more 
conservative amino acid substitutions from natural mutations or human manipulation 
as indicated in Table 3. Changes are preferably of a minor nature, such as conservative 
amino acid substitutions that do not significantly affect the folding or activity of the 

20 protein. Residues from the following groups, as indicated in Table 3, may be 

substituted for one another: Aromatic, Hydrophobic, Polar, Basic, Acidic, and Small, 

Explanation of Table 4 

Table 4 lists residues comprising antigenic epitopes of antigenic epitope- 
25 bearing fragments present in each of the full length E, faecalis polypeptides described 
in Table 1 as predicted by the inventors using the algorithm of Jameson and Wolf, 
(1988) Comp. Appl. Biosci. 4:181-186. The Jameson-Wolf antigenic analysis was 
performed using the computer program PROTEAN (Version 3. 1 1 for the Power 
Macintosh, DNASTAR, Inc., 1228 South Park Street Madison, Wl). E. faecalis 
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polypeptide shown in Table 1 may one or more antigenic epitopes comprising 
residues described in Table 4. It will be appreciated that depending on the analytical 
criteria used to predict antigenic determinants, the exact address of the determinant 
may vary slightly. The residues and locations shown described in Table 4 correspond 
5 to the amino acid sequences for each lull length gene sequence shown in Table 1 and in 
the Sequence Listing. Polypeptides of the present invention that do not have 
antigenic epitopes recognized by the Jameson-Wolf algorithm are not represented in 
Table 2. 

1 0 Selection of Nucleic Acid Sequences Encoding Antigenic E, faecalis Polypeptides 

Sequenced E, faecalis genomic DNA was obtained from the E. faecalis strain 
V586, The E. faecalis strain V586 was deposited 2 May 1997 at the ATCC, 10801 
University Blvd. Manassas, VA 201 10-2209, and given accession number 55969. 
Some ORFs contained in the subset of fragments of the E, faecalis genome 

15 disclosed herein were derived through the use of a number of screening criteria detailed 
below. The ORFs are boxmded at the amino terminus by a methionine or valine 
residue and usually at the carboxy terminus by a stop codon. 

Most of the selected sequences consist of complete ORFs. The polypeptides 
that do not comprise a complete ORF can be determined by determining whether the 

20 corresponding polynucleotide sequence comprises a stop codon after the codon for 
the last amino acid residue in the polypeptide sequence. It is not always preferred to 
express a complete ORF in a heterologous system. It may be challenging to express 
and purify a highly hydrophobic protein by common laboratory methods. Some of 
the polypeptide vaccine candidates described herein have been modified slightly to 

25 simplify the production of recombinant protein. For example, nucleotide sequences 
which encode highly hydrophobic domains, such as those found at the amino terminal 
signal sequence, have been excluded from some constructs used for expression of the 
polypeptides. Furthermore, any highly hydrophobic amino acid sequences occurring 
at the caiboxy terminus have also been excluded fix)m the recombinant expression 
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constructs. Thus, in one embodiment, a polypeptide which represents a truncated or 
modified ORF may be used as an antigen. 

While numerous methods are known in the art for selecting potentially 
immunogenic polypeptides, many of the ORFs disclosed herein were selected on the 
5 basis of screening Enterococcus faecalis ORFs for several aspects of potential 
iramunogenicity. One set of selection criteria are as follows: 

1 . Type I signal sequence: An amino terminal type I signal sequence generally 
directs a nascent protein across the plasma and outer membranes to the exterior of the 
bacterial cell. Experimental evidence obtained from studies with Escherichia coli 

10 suggests that the typical type I signal sequence consists of the following biochemical 
and physical attributes (Izard, J. W. and Kendall, D, A. Mol Microbiol 13:765-772 
(1994)). The length of the type I signal sequence is approximately 15 to 25 primarily 
hydrophobic amino acid residues with a net positive charge in the extreme amino 
terminus. In addition, the central region of the signal sequence adopts an alpha-helical 

15 conformation in a hydrophobic environment. Finally, the region surrounding the 

actual site of cleavage is ideally six residues long, with small side-chain amino acids in 
the -1 and -3 positions. 

2. Type IV signal sequence: The type IV signal sequence is an example of the 
several types of functional signal sequences which exist in addition to the type I signal 

20 sequence detailed above. Although functionally related, the type IV signal sequence 
possesses a imique set of biochemical and physical attributes (Strom, M. S. and Lory, 
S., J. Bacteriol J 7^:7345-735 1 (1992)). These are typically six to eight amino acids 
with a net basic charge followed by an additional sixteen to thirty primarily 
hydrophobic residues. The cleavage site of a type IV signal sequence is typically after 

25 the initial six to eight amino acids at the extreme amino terminus. In addition, type IV 
signal sequences generally contain a phenylalanine residue at the +1 site relative to the 
cleavage site. 

3. Lipoprotein: Studies of the cleavage sites of twenty-six bacterial \ 
lipoprotein precursors has allowed the definition of a consensus amino acid sequence 
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for lipoprotein cleavage. Nearly three-fourths of the bacterial lipoprotein precursors 
examined contained the sequence L-(A,S)-(G,A)-C at positions -3 to +1, relative to 
the point of cleavage (Hayashi, S. and Wu, H. C, 7. Bioenerg, Biomembr, 22:451-471 
(1990)). 

5 4. LPXTG motif: It has been experimentally determined that most anchored 

proteins found on the surface of gram-positive bacteria possess a highly conserved 
carboxy terminal sequence. More than fifty such proteins from organisms such as S. 
pyogenes, S. mutans, E.faecalis, S, pneumoniae, and others, have been identified based 
on their extracellular location and carboxy terminal amino acid sequence (Fischetti, V. 

10 A., ASM News 62:405-4 10(1 996)) . The conserved region consists of six charged 
amino acids at the extreme carboxy terminus coupled to 15-20 hydrophobic amino 
acids presumed to function as a transmembrane domain. Immediately adjacent to the 
transmembrane domain is a six amino acid sequence conserved in nearly all proteins 
examined. The amino acid sequence of this region is L-P-X-T-G-X, where X is any 

15 amino acid. 

An algorithm for selecting antigenic and immunogenic Enterococcus faecalis 
polypeptides including the foregoing criteria was developed. The algorithm is similar 
to that described in U.S. patent application 08/781,986, filed January 3, 1997, which 
is fully incorporated by reference herein. Use of the algorithm by the inventors to 
20 select immunologically useful Enterococcus faecalis polypeptides resulted in the 
selection of a number of the disclosed ORFs. Polypeptides comprising the 
polypeptides identified in this group may be produced by techniques standard in the 
art and as further described herein. 

25 Nucleic Acid Molecules 

Sequenced E. faecalis genomic DNA was obtained from the E, faecalis strain V586. As 
discussed elsewhere hererin, polynucleotides of the present invention readily may be 
obtained by routine application of well known and standard procedures for cloning 
and sequencing DNA. Detailed methods for obtaining libraries and for sequencing are 
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provided below, for instance. A wide variety of Enterococcus faecalis strains that can 
be used to prepare E. faecalis genomic DNA for cloning and for obtaining 
polynucleotides and polypeptides of the present invention. A wide variety of 
Enterococcus faecalis strains are available to the pubhc from recognized depository 
5 institutions, such as the American Type Culture Collection (ATCC). It is recognized 
that minor variation is the nucleic acid and amino acid sequence may be expected from 
E faecalis strain to strain. The present invention provides for genes, including both 
polynucleotides and polypeptides, of the of the present invention from all the 
Enterococcus faecalis strains. 

10 Unless otherwise indicated, all nucleotide sequences determined by sequencing 

a DNA molecxile herein were determined using an automated DNA sequencer (such as 
the Model 373 from Applied Biosystems, Inc., Foster City, CA), and all amino acid 
sequences of polypeptides encoded by DNA molecules determined herein were 
predicted by translation of a DNA sequence determined as above. Therefore, as is 

15 known in the art for any DNA sequence determined by this automated approach, any 
nucleotide sequence determined herein may contain some errors. Nucleotide 
sequences determined by automation are typically at least about 90% identical, more 
typically at least about 95% to at least about 99.9% identical to the actual nucleotide 
sequence of the sequenced DNA molecule. The actual sequence can be more 

20 precisely determined by other approaches including manual DNA sequencing methods 
well known in the art. As is also known in the art, a single insertion or deletion in a 
determined nucleotide sequence compared to the actual sequence will cause a frame 
shift in translation of the nucleotide sequence such that the predicted amino acid 
sequence encoded by a determined nucleotide sequence will be completely different 

25 from the amino acid sequence actually encoded by the sequenced DNA molecule, 
beginning at the point of such an insertion or deletion. In case of conflict between 
Table 1 and either the nucleic acid sequence of the clones Hsted in Table 1 or the amino 
acid sequence of the protein expressed by the clones listed in Table I, the clones listed 
in Table I are controlling. By "nucleotide sequence" of a nucleic acid molecule or 
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polynucleotide is intended to mean either a DNA or RNA sequence.Using the 
information provided herein, such as the nucleotide sequence in Table 1 , a nucleic acid 
molecule of the present invention encoding a Kfaecalis polypeptide may be obtained 
using standard cloning and screening procedures, such as those for cloning DNAs 

5 using genomic DNA as starting material See, e.g., Sambrook et al. MOLECULAR 
CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 
1989); Ausubel et al, CURRENT PROTOCALS IN MOLECULAR BIOLOGY 
(John Wiley and Sons, N.Y. 1989). Illustrative of the invention, the nucleic acid 
molecule described in Table 1 was discovered in a DNA library derived from a E, 

1 0 faecalis genomic DNA. 

Nucleic acid molecules of the present invention may be in the form of RNA, 
such as mRNA, or in the form of DNA, including, for instance, DNA and genomic 
DNA obtained by cloning or produced synthetically. The DNA may be 
double-stranded or single-stranded. Single-stranded DNA or RNA may be the coding 

15 strand, also known as the sense strand, or it may be the non-coding strand, also 
referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, 
DNA or RNA, which has been removed from its native enviroimient. This includes 
segments of DNA comprising the E, faecalis polynucleotides of the present invention 

20 isolated from the native chromosome. These fragments include both isolated 

fragments consisting only of E. faecalis DNA and fragments comprising heterologous 
sequences such as vector sequences or other foreign DNA. For example, recombinant 
DNA molecules contained in a vector are considered isolated for the purposes of the 
present invention. Further examples of isolated DNA molecules include recombinant 

25 DNA molecules maintained in heterologous host cells or purified (partially or 

substantially) DNA molecules in solution. Isolated RNA molecules include in vivo or 
in vitro RNA transcripts of the DNA molecules of the present invention. Isolated 
nucleic acid molecules according to the present invention furftier include such 
molecules produced synthetically. 
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In addition, isolated nucleic acid molecules of the invention include DNA 
molecules which comprise a sequence substantially different from those described 
above but which, due to the degeneracy of the genetic code, still encode a faecalis 
polypeptides and peptides of the present invention (e.g. polypeptides of Table 1). 
5 That is, all possible DNA sequences that encode the E. faecalis polypeptides of the 
present invention. This includes the genetic code and species-specific codon 
preferences known in the art. Thus, it would be routine for one skilled in the art to 
generate the degenerate vztriants described above, for instance, to optimize codon 
expression for a particular host (e.g., change codons in the bacteria mRNA to those 

10 preferred by a mammalian or other bacterial host such as E, coli). 

The invention further provides isolated nucleic acid molecules having the 
nucleotide sequence shoAvn in Table 1 or a nucleic acid molecule having a sequence 
complementary to one of the above sequences. Such isolated molecules, particularly 
DNA molecules, are useful as probes for gene mapping and for identifying E. faecalis 

15 in a biological sample, for instance, by PCR, Southern blot, Northern blot, or other 
form of hybridization analysis. 

The present invention is further directed to nucleic acid molecules encoding 
portions or fragments of the nucleotide sequences described herein. Fragments include 
portions of the nucleotide sequences of Table 1, or the E, faecalis nucleotide 

20 sequences contained in the plasimd clones listed in Table 1 , at least 10 contiguous 
nucleotides in length selected from any two integers, one of which representing a 5' 
nucleotide position and a second of which representing a 3' nucleotide position, where 
the first nucleotide for each nucleotide sequence in Table 1 is position 1 . That is, 
every combination of a 5* and 3' nucleotide position that a fragment at least 10 

25 contiguous nucleotides in length could occupy is included in the invention. At least 
means a fragment may be 10 contiguous nucleotide bases in length or any integer 
between 10 and the length of an entire nucleotide sequence of Table 1 minus 1. 
Therefore, included in the invention are contiguous fragments specified by any 5' and 
3' nucleotide base positions of a nucleotide sequences of Table 1 wherein the 
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contiguous fragment is any integer between 10 and the length of an entire nucleotide 
sequence minus 1 . 

Further, the invention includes polynucleotides comprising fragments specified 
by size, in nucleotides, rather than by nucleotide positions. The invention includes 

5 any fragment size, in contiguous nucleotides, selected from integers between 10 and 
the length of an entire nucleotide sequence minus 1 . Preferred sizes of contiguous 
nucleotide fragments include 20 nucleotides, 30 nucleotides, 40 nucleotides, 50 
nucleotides. Other preferred sizes of contiguous nucleotide frugments, which may be 
useful as diagnostic probes and primers, include fragments 50-300 nucleotides in 

10 length which include, as discussed above, fragment sizes representing each integer 
between 50-300. Larger fragments are also useful according to the present invention 
corresponding to most, if not all, of the nucleotide sequences shown in Table lor of 
the E.faecalis nucleotide sequences of the plasimd clones listed in Table 1. The 
preferred sizes are, of course, meant to exemplify not limit the present invention as all 

15 size fragments, representing any integer between 10 and the length of an entire 
nucleotide sequence minus 1, are included in the invention. Additional preferred 
nucleic acid fragments of the present invention include nucleic acid molecules encoding 
epitope-bearing portions of E.faecalis polypeptides identified in Table 4, 

The present invention also provides for the exclusion of any fragment, 

20 specified by 5* and 3* base positions or by size in nucleotide bases as described above 
for any nucleotide sequence of Table 1 or the plasimd clones listed in Table 1 . Any 
number of fragments of nucleotide sequences in Table 1 or the plasimd clones listed in 
Table 1, specified by 5' and 3' base positions or by size in nucleotides, as described 
above, may be excluded from the present invention. 

25 In another aspect, the invention provides an isolated nucleic acid molecule 

comprising a polynucleotide which hybridizes xmder stringent hybridization 
conditions to a portion of a polynucleotide in a nucleic acid molecules of the invention 
described above, for instance, nucleotide sequences of Table 1 or the E.faecalis 
sequences of the plasimd clones listed in Table 1. By "stringent hybridization 
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conditions" is intended overnight incubation at 42°C in a solution comprising: 50% 
formamide, 5x SSC (150 mM NaCl, 1 5 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 |ig/ml 
denatured, sheared salmon sperm DNA, followed by washing the filters in 0.1 x SSC at 
5 about 65 °C. 

By a polynucleotide which hybridizes to a "portion" of a polynucleotide is 
intended a polynucleotide (either DNA or RNA) hybridizing to at least about 15 
nucleotides bases, and more preferably at least about 20 nucleotides bases, still more 
preferably at least about 30 riucleotides bases, and even more preferably about 30-70 

10 (e.g., 50) nucleotides bases of the reference polynucleotide. These are useful as 

diagnostic probes and primers as discussed above. By a portion of a polynucleotide 
of "at least 20 nucleotides bases in length," for example, is intended 20 or more 
contiguous nucleotides bases nucleotides from the nucleotide sequence of the reference 
polynucleotide (e.g., the nucleotide sequence as shown in Table 1). Portions of a 

15 polynucleotide which hybridizes to a nucleotide sequence in Table 1, which can be 
used as probes and primers, may also be precisely specified by 5' and 3* base 
positions or by size in nucleotide bases as described above or precisely excluded in the 
same manner. 

The nucleic acid molecules of the present invention include those encoding the 
20 full length E. faecalis polypeptides of Table 1 and portions of the E. faecalis 
polypeptides of Table 1 . Also included in the present invention are nucleic acids 
encoding the above full length sequences and further comprise additional sequences, 
such as those encoding an added secretory leader sequence, such as a pre-, or pro- or 
prepro- protein sequence. Further included in the present invention are nucleic acids 
25 encoding the above full length sequences and portions thereof and further comprise 
additional heterologous amino acid sequences encoded by nucleic acid sequences from 
a different soiirce. 

Also included in the present invention are nucleic acids encoding the above 
protein sequences together with additional, non-coding sequences, including for 
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example, but not limited to non-coding 5' and 3' sequences. These sequences include 
transcribed, non-translated sequences that may play a role in transcription, and 
mRNA processing, for example, ribosome binding and stability of mRNA. Also 
included in the present invention are additional coding sequences which provide 

5 additional functionalities. 

Thus, a nucleotide sequence encoding a polypeptide may be fused to a marker 
sequence, such as a sequence encoding a peptide which facilitates purification of the 
fused polypeptide. In certain preferred embodiments of this aspect of the invention, 
the marker amino acid sequence is a hexa-histidine peptide, such as the tag provided in 

10 a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311), among 
others, many of which are commercially available. For instance, hexa-histidine 
provides for convenient purification of the fusion protein. See Gentz et al. (1989) 
Proc. Natl. Acad. Sci. 86:821-24. The "HA" tag is another peptide useful for 
purification which corresponds to an epitope derived from the influenza hemagglutinin 

15 protein. See Wilson et al. (1 984) Cell 37:767. As discussed below, other such fusion 
proteins include the E. faecalis polypeptides of the present invention fused to Fc at 
the N- or C-tenninus. 

Variant and Mutant Polynucleotides 
20 The present invention further relates to variants of the nucleic acid molecules 

which encode portions, analogs or derivatives of a E, faecalis polypeptides of Table 1 

and variant polypeptides thereof including portions, analogs, and derivatives of the E. 

faecalis polypeptides. Variants may occur naturally, such as a natural allelic variant. 

By an "allelic variant" is intended one of several altemate forms of a gene occupying a 
25 given locus on a chromosome of an organism. See, e.g., B. Lewin, Genes IV (1990). 

Non-naturally occurring variants may be produced using art-knovra mutagenesis 

techniques. 

Such nucleic acid variants include those produced by nucleotide substitutions, 
deletions, or additions. The substitutions, deletions, or additions may involve one or 
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more nucleotides. The variants may be altered in coding regions, non-coding regions, 
or both. Alterations in the coding regions may produce conservative or 
non-conservative amino acid substitutions, deletions or additions. Especially 
preferred among these are silent substitutions, additions and deletions, which do not 
5 alter the properties and activities of a E.faecalis protein of the present invention or 
portions thereof. Also especially preferred in this regard are conservative 
substitutions. 

Such polypeptide variants include those produced by amino acid 
substitutions, deletions or additions. The substitutions, deletions, or additions may 

10 involve one or more residues. Alterations may produce conservative or 

non-conservative amino acid substitutions, deletions, or additions. Especially 
preferred among these are silent substitutions, additions and deletions, which do not 
alter the properties and activities of a E:faecalis protein of the present invention or 
portions thereof. Also especially preferred in this regard are conservative 

15 substitutions. 

The present invention also relates to recombinant vectors, which include the 
isolated nucleic acid molecules of the present invention, and to host cells containing 
the recombinant vectors, as well as to methods of making such vectors and host cells 
and for using them for production of E, faecalis polypeptides or peptides by 

20 recombinant techniques. 

The present application is directed to nucleic acid molecules at least 90%, 
95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence shown in Table 1 . 
The above nucleic acid sequences are included irrespective of whether they encode a 
polypeptide having E, faecalis activity. This is because even where a particular 

25 nucleic acid molecule does not encode a polypeptide having E, faecalis activity, one of 
skill in the art would still know how to use the nucleic acid molecule, for instance, as a 
hybridization probe. Uses of the nucleic acid molecules of the present invention that 
do not encode a polypeptide having E, faecalis activity include, inter alia^ isolating an 
E, faecalis gene or allelic variants thereof from a DNA library, and detecting E. faecalis 



wo 98/50554 



.20- 



PCT/US98/08959 



mRNA expression samples, environmental samples, suspected of containing E. 
faecalis by Northern Blot analysis. 

Preferred, are nucleic acid molecules having sequences at least 90%, 95%, 96%, 
97%, 98% or 99% identical to the nucleic acid sequence shown in Table 1, v^hich do, 
5 in fact, encode a polypeptide having E, faecalis protein activity By "a polypeptide 
having E, faecalis activity" is intended polypeptides exhibiting activity similar, but 
not necessarily identical, to an activity of the E, faecalis protein of the invention, as 
measured in a particular biological assay suitable for measuring activity of the 
specified protein. 

10 Due to the degeneracy of the genetic code, one of ordinary skill in the art will 

immediately recognize that a large number of the nucleic acid molecules having a 
sequence at least 90%, 95%, 96%, 97%, 98%, or 99% identical to the nucleic acid 
sequences shown in Table 1 will encode a polypeptide having E, faecalis protein 
activity. In fact, since degenerate variants of these nucleotide sequences all encode the 

15 same polypeptide, this will be clear to the skilled artisan even Avithout performing the 
above described comparison assay. It will be further recognized in the art that, for 
such nucleic acid molecules that are not degenerate variants, a reasonable nimiber will 
also encode a polypeptide having E. faecalis protein activity. This is because the 
skilled artisan is fully aware of amino acid substitutions that are either less likely or 

20 not likely to significantly effect protein function (e.g., replacing one aliphatic amino 
acid with a second aliphatic amino acid), as further described below. 

The biological activity or function of the polypeptides of the present 
invention are expected to be similar or identical to polypeptides from other bacteria 
that share a high degree of structural identity/similarity. Tables 2 lists accession 

25 numbers and descriptions for the closest matching sequences of polypeptides 

available through Genbank and Derwent databases. It is therefore expected that the 
biological activity or function of the polypeptides of the present invention will be 
similar or identical to those polypeptides from other bacterial genuses, species, or 
strains listed in Table 2. 
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By a polynucleotide having a nucleotide sequence at least, for example, 95% 
"identical" to a reference nucleotide sequence of the present invention, it is intended 
that the nucleotide sequence of the polynucleotide is identical to the reference 
sequence except that the polynucleotide sequence may include up to five point 
5 mutations per each 1 00 nucleotides of the reference nucleotide sequence encoding the 
E.faecalis polypeptide. In other words, to obtain a polynucleotide having a 
nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 
5% of the nucleotides in the reference sequence may be deleted, inserted, or 
substituted with another nucleotide. The query sequence may be an entire sequence 
10 shown in Table 1 , the ORF (open reading frame), or any fragment specified as 
described herein. 

As a practical matter, whether any particular nucleic acid molecule or 
polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 

1 5 computer programs. A preferred method for determining the best overall match 
between a query sequence (a sequence of the present invention) and a subject 
sequence, also referred to as a global sequence alignment, can be determined using the 
FASTDB computer program based on the algorithm of Brutlag et al. See Bmtlag et 
al. (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the query and 

20 subject sequences are both DNA sequences. An RNA sequence can be compared by 
first converting U's to T's. The result of said global sequence alignment is in percent 
identity. Preferred parameters used in a FASTDB alignment of DNA sequences to 
calculate percent identity are: Matrix=Unitary, k-tuple=4, Mismatch Penalty=l , 
Joining Penalty=30, Randomization Group Length=0, Cutoff Score=l, Gap 

25 Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght of the subject 
nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3 ' 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for 5' and 3' 
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truncations of the subject sequence when calculating percent identity. For subject 
sequences truncated at the 5' or 3' ends, relative to the query sequence, the percent 
identity is corrected by calculating the number of bases of the query sequence that are 
5' and 3' of the subject sequence, which are not matched/aligned, as a percent of the 
5 total bases of the query sequence. Whether a nucleotide is matched/aligned is 

determined by results of the FASTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This corrected 
^ score is what is used for the purposes of the present invention. Only nucleotides 

10 outside the 5' and 3' nucleotides of the subject sequence, as displayed by the 
FASTDB alignment, which are not matched/aligned with the query sequence, are 
calculated for the purposes of manually adjusting the percent identity score. 

For example, a 90 nucleotide subject sequence is aligned to a 100 nucleotide 
query sequence to determine percent identity. The deletions occur at the 5' end of the 

15 subject sequence and therefore, the FASTDB alignment does not show a 

matched/alignment of the first 10 nucleotides at 5' end. The 10 unpaired nucleotides 
represent 10% of the sequence (number of nucleotides at the 5' and 3' ends not 
matched/total number of nucleotides in the query sequence) so 10% is subtracted fi-om 
the percent identity score calculated by ttie FASTDB program. If the remaining 90 

20 nucleotides were perfectly matched the final percent identity would be 90%. In 

another example, a 90 nucleotide subject sequence is compared with a 100 nucleotide 
query sequence. This time the deletions are internal deletions so that there are no 
nucleotides on the 5' or 3' of the subject sequence which are not matched/aligned with 
the query. In this case the percent identity calculated by FASTDB is not manually 

25 corrected. Once again, only nucleotides 5 * and 3' of the subject sequence which are 
not matched/aHgned v^th the query sequence are manually corrected for. No other 
manual corrections are to made for the purposes of the present invention. 

Vectors and Host Cell 
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The present invention also relates to vectors which include the isolated DN A 
molecules of the present invention, host cells comprising the recombinant vectors, and 
the production of E.faecalis polypeptides and peptides of the present invention 
expressed by the host cells. 
5 Recombinant constructs may be introduced into host cells using well known 

techniques such as infection, transduction, transfection, transvection, electroporation 
and transformation. The vector may be, for example, a phage, plasmid, viral or 
retroviral vector. Retroviral vectors may be replication competent or replication 
defective. In the latter case, viral propagation generally will occur only in 
1 0 complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker 
for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, 
such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the 
vector is a vims, it may be packaged in vitro using an appropriate packaging cell line 
15 and then transduced into host cells. 

Preferred are vectors comprising cw-acting control regions to the 
polynucleotide of interest. Appropriate ^ra/^5-acting factors may be supplied by the 
host, supplied by a complementing vector or supplied by the vector itself upon 
introduction into the host. 
20 In certain preferred embodiments in this regard, the vectors provide for 

specific expression, which may be inducible and/or cell type-specific. Particularly 
preferred among such vectors are those inducible by environmental factors that are 
easy to manipulate, such as temperature and nutrient additives. 

Expression vectors useful in the present invention include chromosomal-, 
25 episomal- and virus-derived vectors, e.g., vectors derived from bacterial plasmids, 
bacteriophage, yeast episomes, yeast chromosomal elements, viruses such as 
baculoviruses, papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, 
pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, 
such as cosmids and phagemids. 
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The DNA insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters, the 
SV40 early and late promoters and promoters of retroviral LTRs, to name a few. 
Other suitable promoters will be knoAvn to the skilled artisan. The expression 
5 constructs will further contain sites for transcription initiation, termination and, in the 
transcribed region, a ribosome binding site for translation. The coding portion of the 
mature transcripts expressed by the constructs will preferably include a translation 
initiating site at the beginning and a termination codon (UAA, UGA or UAG) 
appropriately positioned at the end of the polypeptide to be translated. 

10 As indicated, the expression vectors will preferably include at least one 

selectable marker. Such markers include dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture and tetracycline, kanamycin, or ampicillin 
resistance genes for culturing in E, coli and other bacteria. Representative examples of 
appropriate hosts include, but are not limited to, bacterial cells, such as E, coli, 

15 Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect 
cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS 
and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE9, 

20 pQElO available from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, 
pNH8A, pNH16a, pNHlSA, pNH46A available from Stratagene; pET series of 
vectors available from Novagen; and ptrc99a, pKK223-3, pKK233-3, pDR540, 
pRIT5 available from Pharmacia. Among preferred eukaryotic vectors are pWLNEO, 
pSV2CAT, pOG44, pXTl and pSG available from Stratagene; andpSVK3, pBPV, 

25 pMSG and pSVL available from Pharmacia. Other suitable vectors will be readily 
apparent to the skilled artisan. 

Among known bacterial promoters suitable for use in the present invention 
include the £. coli lad and lacZ promoters, the T3, T5 and T7 promoters, the gpt 
promoter, the lambda PR and PL promoters and the trp promoter. Suitable eukaryotic 
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promoters include the CMV immediate early promoter, the HSV thymidine kinase 
promoter, the early and late SV40 promoters, the promoters of retroviral LTRs, such 
as those of the Rous sarcoma virus (RSV), and metallothionein promoters, such as the 
mouse metallothionein-I promoter. 
5 Introduction of the construct into the host cell can be effected by calcium 

phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporation, transduction, infection or other methods. Such methods 
are described in many standard laboratory manuals (for example, Davis, et al, Basic 
Methods In Molecular Biology ( 1 986)) . 

10 Transcription of DNA encoding the polypeptides of the present invention by 

higher eukaryotes may be increased by inserting an enhancer sequence into the vector. 
Enhancers are cw-acting elements of DNA, usually about from 10 to 300 nucleotides 
that act to increase transcriptional activity of a promoter in a given host cell-type. 
Examples of enhancers include the SV40 enhancer, which is located on the late side of 

15 the replication origin at nucleotides 100 to 270, the cytomegalovirus early promoter 
enhancer, the polyoma enhancer on the late side of the replication origin, and 
adenovirus enhancers. 

For secretion of the translated polypeptide into the lumen of the endoplasmic 
reticulum, into the periplasmic space or into the extracellular environment, 

20 appropriate secretion signals may be incorporated into the expressed polypeptide, for 
example, the amino acid sequence KDEL. The signals may be endogenous to the 
polypeptide or they may be heterologous signals. 

The polypeptide may be expressed in a modified form, such as a fusion 
protein, and may include not only secretion signals, but also additional heterologous 

25 functional regions. For instance, a region of additional amino acids, particularly 

charged amino acids, may be added to the N-terminus of the polypeptide to improve 
stability and persistence in the host cell, during purification, or during subsequent 
handling and storage. Also, peptide moieties may be added to the polypeptide to 
facilitate purification. Such regions may be removed prior to final preparation of the 
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polypeptide. The addition of peptide moieties to polypeptides to engender secretion 
or excretion, to improve stability and to facilitate purification, among others, are 
familiar and routine techniques in the art. A preferred fusion protein comprises a 
heterologous region from immunoglobulin that is useful to solubilize proteins. For 
5 example, EP-A-0 464 533 (Canadian counterpart 2045869) discloses fusion proteins 
comprising various portions of constant region of in^unoglobulin molecules together 
with another human protein or part thereof. In many cases, the Fc part in a fusion 
protein is thoroughly advantageous for use in therapy and diagnosis and thus results, 
for example, in improved pharmacokinetic properties (EP-A 0232 262). On the other 

10 hand, for some uses it would be desirable to be able to delete the Fc part after the 
fusion protein has been expressed, detected and piu-ified in the advantageous manner 
described. This is the case when Fc portion proves to be a hindrance to use in 
therapy and diagnosis, for example when the fusion protein is to be used as antigen for 
immunizations. In drug discovery, for example, human proteins, such as, 

15 hIL5-receptor has been fused with Fc portions for the purpose of high-throughput 
screening assays to identify antagonists of hIL-5. See Bennett, D. et al. (1995) J. 
Molec. Recogn. 8:52-58 and Johanson, K. et al. (1995) J. Biol. Chem. 270 
(16):9459-9471. 

The E.faecalis polypeptides can be recovered and purified from recombinant 
20 cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography, lectin chromatography and high 
performance liquid chromatography ("HPLC") is employed for purification. 
25 Polypeptides of the present invention include naturally purified products, products of 
chemical synthetic procedures, and products produced by recombinant techniques 
from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher 
plant, insect and mammalian cells. 
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Polypeptides and Fragments 

The invention ftirther provides an isolated Kfaecalis polypeptide having an 
amino acid sequence in Table 1 , or a peptide or polypeptide comprising a portion of 
the above polypeptides. 

5 

Variant and Mutant Polypeptides 

To improve or alter the characteristics of E,faecalis polypeptides of the 
present invention, protein engineering may be employed. Recombinant DNA 
technology known to those skilled in the art can be used to create novel mutant 
10 proteins or muteins including single or multiple amino acid substitutions, deletions, 
additions, or fusion proteins. Such modified polypeptides can show, e.g., enhanced 
activity or increased stability. In addition, they may be purified in higher yields and 
show better solubility than the corresponding natural polypeptide, at least under ' 
certain purification and storage conditions. 

15 

N'Terminal and C-Terminal Deletion Mutants 

It is known in the art that one or more amino acids may be deleted from the 
N-terminus or C-terminus without substantial loss of biological function. For 
instance, Ron et al. J. Biol. Chem., 268:2984-2988 (1993), reported modified KGF 

20 proteins that had heparin binding activity even if 3, 8, or 27 N-terminal amino acid 
residues were missing. Accordingly, the present invention provides polypeptides 
having one or more residues deleted from the amino terminus of the amino acid 
sequence of the E.faecalis polypeptides shown in Table 1, and polynucleotides 
encoding such polypeptides. 

25 Similarly, many examples of biologically ftinctional C-terminal deletion 

muteins are known. For instance. Interferon gamma shows up to ten times higher 
activities by deleting 8-10 amino acid residues from the carboxy terminus of the 
protein iJee, e.g., Dobeli, et al. (1988) J. Biotechnology 7:199-216. Accordingly, the 
present invention provides polypeptides having one or more residues from the 
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carboxy terminus of the amino acid sequence of the E.faecalis polypeptides shown in 
Table 1 . The invention also provides polypeptides having one or more amino acids 
deleted from both the amino and the carboxyl tennini as described below. 

The present invention is further directed to polynucleotide encoding portions 
5 or fragments of the amino acid sequences described herein as well as to portions or 
fragments of the isolated amino acid sequences described herein. Fragments include 
portions of the amino acid sequences of Table 1, are at least 5 contiguous amino acid 
in length, are selected from any two integers, one of which representing a N-terminal 
position. The initiation codon of the polypeptides of the present inventions position 

10 1 . Every combination of a N-terminal and C-terminal position that a fragment at least 
5 contiguous amino acid residues in length could occupy, on any given amino acid 
sequence of Table 1 is included in the invention. At least means a fragment may be 5 
contiguous amino acid residues in length or any integer between 5 and the number of 
residues in a full length amino acid sequence minus 1 . Therefore, included in the 

15 invention are contiguous fragments specified by any N-terminal and C-terminal 

positions of amino acid sequence set forth in Table 1 wherein the contiguous fragment 
is any integer between 5 and the number of residues in a full length sequence minus 1. 

Further, the invention includes polypeptides comprising fragments specified 
by size, in amino acid residues, rather than by N-terminal and C-terminal positions. 

20 The invention includes any fragment size, in contiguous amino acid residues, selected 
from integers between 5 and the number of residues in a full length sequence minus 1 . 
Preferred sizes of contiguous polypeptide fragments include about 5 amino acid 
residues, about 10 amino acid residues, about 20 amino acid residues, about 30 amino 
acid residues, about 40 amino acid residues, about 50 amino acid residues, about 100 

25 amino acid residues, about 200 amino acid residues, about 300 amino acid residues, 
and about 400 amino acid residues. The preferred sizes are, of course, meant to 
exemplify, not limit, the present invention as all size fragments representing any 
integer between 5 and the number of residues in a full length sequence minus 1 are 
included in the invention. The present invention also provides for the exclusion of any 
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fragments specified by N-terminal and C-terminal positions or by size in amino acid 
residues as described above. Any number of fragments specified by N-terminal and 
C-terminal positions or by size in amino acid residues as described above may be 
excluded. 

5 The above fragments need not be active since they would be useful, for 

example, in immunoassays, in epitope mapping, epitope tagging, to generate 
antibodies to a particular portion of the protein, as vaccines, and as molecular weight 
markers. 

10 Other Mutants 

In addition to N- and C-terminal deletion forms of the protein discussed above, 
it also will be recognized by one of ordinary skill in the art that some amino acid 
sequences of the E. faecalis polypeptide can be varied v^thout significant effect of the 
structure or function of the protein. If such differences in sequence are contemplated, 

1 5 it should be remembered that there will be critical areas on the protein which 
determine activity. 

Thus, the invention further includes variations of the E, faecalis polypeptides 
which show substantial E. faecalis polypeptide activity or which include regions of E, 
faecalis protein such as the protein portions discussed below. Such mutants include 

20 deletions, insertions, inversions, repeats, and type substitutions selected according to 
general rules known in the art so as to have little effect on activity. For example, 
guidance concerning how to make phenotypically silent amino acid substitutions is 
provided. There are two main approaches for studying the tolerance of an amino acid 
sequence to change. See, Bowie, J. U. et al (1990), Science 247:1306-1310. The first 

25 method relies on the process of evolution, in which mutations are either accepted or 
rejected by natural selection. The second approach uses genetic engineering to 
introduce amino acid changes at specific positions of a cloned gene and selections or 
screens to identify sequences that maintain functionality. 

These studies have revealed that proteins are surprisingly tolerant of amino 
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acid substitutions. The studies indicate which amino acid changes are likely to be 
permissive at a certain position of the protein. For example, most buried amino acid 
residues require nonpolar side chains, whereas few features of surface side chains are 
generally conserved. Other such phenotypically silent substitutions are described by 
5 Bowie et al. {supra) and the references cited therein. Typically seen as conservative 
substitutions are the replacements, one for another, among the aliphatic amino acids 
Ala, Val, Leu and He; interchange of the hydroxyl residues Ser and Thr, exchange of 
the acidic residues Asp and Glu, substitution between the amide residues Asn and 
Gin, exchange of the basic residues Lys and Arg and replacements among the aromatic 

10 residues Phe, Tyr. 

Thus, the fragment, derivative, analog, or homolog of the polypeptide of Table 
1 , or that encoded by the plaimds listed in Table 1, may be: (i) one in which one or 
more of the amino acid residues are substituted with a conserved or non-conserved 
amino acid residue (preferably a conserved amino acid residue) and such substituted 

15 amino acid residue may or may not be one encoded by the genetic code: or (ii) one in 
which one or more of the amino acid residues includes a substituent group: or (iii) one 
in which the E, faecalis polypeptide is fused with another compound, such as a 
compound to increase the half-life of the polypeptide (for example, polyethylene 
glycol): or (iv) one in which the additional amino acids are fused to the above form of 

20 the polypeptide, such as an IgG Fc fusion region peptide or leader or secretory 

sequence or a sequence which is employed for purification of the above form of the 
polypeptide or a proprotein sequence. Such fragments, derivatives and analogs are 
deemed to be within the scope of those skilled in the art from the teachings herein. 

Thus, the E. faecalis polypeptides of the present invention may include one or 

25 more amino acid substitutions, deletions, or additions, either from natural mutations or 
human manipulation. As indicated, changes are preferably of a minor nature, such as 
conservative amino acid substitutions that do not significantly affect the folding or 
activity of the protein (see Table 3). 

Amino acids in the E, faecalis proteins of the present invention that are 
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essential for function can be identified by methods known in the art, such as site- 
directed mutagenesis or alanine-scanning mutagenesis. See, e.g., Cunningham et al. 
(1989) Science 244:1081-1085. The latter procedure introduces single alanine 
mutations at every residue in the molecule. The resulting mutant molecules are then 
5 tested for biological activity using assays appropriate for measiuing the function of 
the particular protein. 

Of special interest are substitutions of charged amino acids with other charged 
or neutral amino acids which may produce proteins with highly desirable improved 
characteristics, such as less aggregation. Aggregation may not only reduce activity but 

10 also be problematic when preparing pharmaceutical formulations, because aggregates 
can be immunogenic. See, e.g., Pinckard et al., (1967) Clin. Exp. Immunol. 2:331-340; 
Robbins, et al., (1987) Diabetes 36:838-845; Cleland, et al, (1993) Crit. Rev. 
Therapeutic Ehixg Carrier Systems 10:307-377. 

The polypeptides of the present invention are preferably provided in an 

15 isolated form, and preferably are substantially purified. A recombinantly produced 
version of the E. faecalis polypeptide can be substantially purified by the one-step 
method described by Smith et al. (1988) Gene 67:31-40. Polypeptides of the 
invention also can be purified from natural or recombinant sources using antibodies 
directed against the polypeptides of the invention in methods which are well knovm in 

20 the art of protein purification. 

The invention further provides for isolated faecalis polypeptides 
comprising an amino acid sequence selected from the group consisting of: (a) the 
amino acid sequence of a full-length E. faecalis polypeptide having the complete 
amino acid sequence shown in Table 1 ; (b) the amino acid sequence of a full-length 

25 faecalis polypeptide having the complete amino acid sequence shown in Table 1 
excepting the N-terminal methionine; (c) the complete amino acid sequence encoded 
by the plaimds listed in Table 1; and (d) the complete amino acid sequence excepting 
the N-terminal methionine encoded by the plaimds listed in Table 1 . The 
polypeptides of the present invention also include polypeptides having an amino acid 
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sequence at least 80% identical, more preferably at least 90% identical, and still more 
preferably 95%, 96%, 97%, 98% or 99% identical to those described in (a), (b), (c), 
and (d) above. 

Further polypeptides of the present invention include polypeptides which 

5 have at least 90% similarity, more preferably at least 95% similarity, and still more 
preferably at least 96%, 97%, 98% or 99% similarity to those described above. 

A further embodiment of the invention relates to a polypeptide which 
comprises the amino acid sequence of a E.faecalis polypeptide having an amino acid 
sequence which contains at least one conservative amino acid substitution, but not 

10 more than 50 conservative amino acid substitutions, not more than 40 conservative 
amino acid substitutions, not more than 30 conservative amino acid substitutions, and 
not more than 20 conservative amino acid substitutions. Also provided are 
polypeptides which comprise the amino acid sequence of a E.faecalis polypeptide, 
having at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino 

15 acid substitutions. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a query amino acid sequence of the present invention, it is intended that 
the amino acid sequence of the subject polypeptide is identical to the query sequence 
except that the subject polypeptide sequence may include up to five amino acid 

20 alterations per each 100 amino acids of the query amino acid sequence. In other 

words, to obtain a polypeptide having an amino acid sequence at least 95% identical 
to a query amino acid sequence, up to 5% of the amino acid residues in the subject 
sequence may be inserted, deleted, (indels) or substituted with another amino acid. 
These alterations of the reference sequence may occur at the amino or carboxy 

25 terminal positions of the reference amino acid sequence or anywhere between those 
terminal positions, interspersed either individually among residues in the reference 
sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 
95%, 96%, 97%, 98% or 99% identical to, for instance, the amino acid sequences 
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shown in Table 1 or to the amino acid sequence encoded by the plaimds listed in Table 
1 can be determined conventionally using known computer programs. A preferred 
method for detennining the best overall match between a query sequence (a sequence 
of the present invention) and a subject sequence, also referred to as a global sequence 
5 alignment, can be determined using the FASTDB computer program based on the 
algorithm of Brutlag et al., (1990) Comp. App. Biosci. 6:237-245. In a sequence 
alignment the query and subject sequences are both amino acid sequences. The result 
of said global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2, Mismatch 

1 0 Penalty^ 1 , Joining Pena]ty=20, Randomization Group Length=0, Cutoff Score= 1 , 
Window Si2e=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window 
Size=500 or the length of the subject amino acid sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
terminal deletions, not because of internal deletions, the results, in percent identity, 

15 must be manually corrected. This is because the FASTDB program does not account 
for N- and C-terminal truncations of the subject sequence when calculating global 
percent identity. For subject sequences truncated at the N- and C-termini, relative to 
the query sequence, the percent identity is corrected by calculating the number of 
residues of the query sequence that are N- and C-terminal of the subject sequence, 

20 which are not matched/aligned with a corresponding subject residue, as a percent of 
the total bases of the query sequence. Whether a residue is matched/aligned is 
determined by results of the FASTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This final percent 

25 identity score is what is used for the purposes of the present invention. Only 
residues to the N- and C-termini of the subject sequence, which are not 
matched/aligned with the query sequence, are considered for the purposes of manually 
adjusting the percent identity score. That is, only query amino acid residues outside 
the farthest N- and C-terminal residues of the subject sequence. 
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For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 
terminus of the subject sequence and therefore, the FASTDB alignment does not 
match/align with the first 10 residues at the N-terminus. The 10 unpaired residues 
5 represent 10% of the sequence (nxmiber of residues at the N- and C- termini not 
matched/total number of residues in the query sequence) so 10% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 90 
residues were perfectly matched the final percent identity would be 90%. In another 
example, a 90 residue subject sequence is compared with a 100 residue queiy 

10 sequence, This time the deletions are internal so there are no residues at the N- or C- 
termini of the subject sequence which are not matched/aligned with the query. In this 
case the percent identity calculated by FASTDB is not manually corrected. Once 
again, only residue positions outside the N- and C-terminal ends of the subject 
sequence, as displayed in the FASTDB alignment, which are not matched/ahgned 

15 with the query sequence are manually corrected. No other manual corrections are to 
made for the purposes of the present invention. 

The above polypeptide sequences are included irrespective of whether they 
have their normal biological activity. This is because even where a particular 
polypeptide molecule does not have biological activity, one of skill in the art would 

20 still know how to use the polypeptide, for instance, as a vaccine or to generate 

antibodies. Other uses of the polypeptides of the present invention that do not have 
E,faecaUs activity include, inter alia, as epitope tags, in epitope mapping, and as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 
columns using methods known to those of skill in the art. 

25 As described below, the polypeptides of the present invention can also be 

used to raise polyclonal and monoclonal antibodies, which are useful in assays for 
detecting E.faecalis protein expression or as agonists and antagonists capable of 
enhancing or inhibiting Kfaecalis protein function. Further, such polypeptides can be 
used in the yeast two-hybrid system to "capture" E. faecalis protein binding proteins 
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which are also candidate agonists and antagonists according to the present invention. 
See. e.g.. Fields et al. (1989) Nature 340:245-246. 

Epitope-Bearing Portions 
5 In another aspect, the invention provides peptides and polypeptides 

comprising epitope-bearing portions of the E.faecalis polypeptides of the present 
invention. These epitopes are immunogenic or antigenic epitopes of the polypeptides 
of the present invention. An "immunogenic epitope" is defined as a part of a protein 
that elicits an antibody response when the whole protein or polypeptide is the 

10 immunogen. These immunogenic epitopes are beheved to be confined to a few loci on 
the molecule. On the other hand, a region of a protein molecule to which an antibody 
can bind is defined as an "antigenic determinant" or "antigenic epitope." The number 
of immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, e.g., Geysen, et al. (1983) Proc. Natl. Acad. Sci. USA 81:3998- 4002. 

15 Predicted antigenic epitopes are shown in Table 4, below. It is pointed out that Table 
4 only lists amino acid residues comprising epitopes predicted to have the highest 
degree of antigenicity. The polypeptides not listed in Table 4 and portions of 
polypeptides not listed in Table 4 are not considered non-antigenic. This is because 
they may still be antigenic in vivo but merely not recognized as such by the particular 

20 algorithm used. Thus, Table 4 Hsts the amino acid residues comprising preferred 
antigenic epitopes but not a complete list. Amino acid residues comprising other 
, anigenic epitopes may be determined by algorithms similar to the Jameson- Wolf 
analysis or by in vivo testing for an antigenic response using the methods described 
herein or those known in the art. 

25 As to the selection of peptides or polypeptides bearing an antigenic epitope 

(/.e., that contain a region of a protein molecule to which an antibody can bind), it is 
well known in that art that relatively short synthetic peptides that mimic part of a 
protein sequence are routinely capable of eliciting an antiserum ttiat reacts with the 
partially mimicked protein. See, e.g., SutcUffe, et al., (1983) Science 219:660-666. 
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Peptides capable of eliciting protein-reactive sera are frequently represented in the 
primary sequence of a protein, can be characterized by a set of simple chemical rules, 
and are confined neither to immunodominant regions of intact proteins (/.e., 
immunogenic epitopes) nor to the amino or carboxyl terminals. Peptides that are 
5 extremely hydrophobic and those of six or fewer residues generally are ineffective at 
inducing antibodies that bind to the mimicked protein; longer, peptides, especially 
those containing proline residues, usually are effective. See, Sutcliffe, et al., supra^ p! 
661. For instance, 18 of 20 peptides designed according to these guidelines, containing 
8-39 residues covering 75% of the sequence of the influenza virus hemagglutinin HAl 

10 polypeptide chain, induced antibodies that reacted with the HAl protein or intact 
virus; and 12/12 peptides from the MuLV polymerase and 18/18 from the rabies 
glycoprotein induced antibodies that precipitated the respective proteins. 

Antigenic epitope-bearing peptides and polypeptides of the invention are 
therefore useful to raise antibodies, including monoclonal antibodies, that bind 

15 specifically to a polypeptide of the invention. Thus, a high proportion of hybridomas 
obtained by fiision of spleen cells from donors inununized with an antigen 
epitope-bearing peptide generally secrete antibody reactive with the native protein. 
See Sutcliffe, et al, supra, p. 663. The antibodies raised by antigenic epitope-bearing 
peptides or polypeptides are useful to detect the mimicked protein, and antibodies to 

20 different peptides may be used for tracking the fate of various regions of a protein 
precursor which undergoes post-translational processing. The peptides and 
anti-peptide antibodies may be used in a variety of qualitative or quantitative assays 
for the mimicked protein, for instance in competition assays since it has been shown 
that even short peptides {e.g., about 9 amino acids) can bind and displace the larger 

25 peptides in immunoprecipitation assays. See, e.g., Wilson, et al., (1984) Cell 
The anti-peptide antibodies of the invention also are useful for 
purification of the mimicked protein, for instance, by adsorption chromatography 
using methods known in the art. 

Antigenic epitope-bearing peptides and polypeptides of the invention 
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designed according to the above guidelines preferably contain a sequence of at least 
seven, more preferably at least nine and most preferably between about 1 0 to about 
50 amino acids (i.e. any integer between 7 and 50) contained within the amino acid 
sequence of a polypeptide of the invention. However, peptides or polypeptides 

5 comprising a larger portion of an amino acid sequence of a polypeptide of the 
invention, containing about 50 to about 100 amino acids, or any length up to and 
including the entire amino acid sequence of a polypeptide of the invention, also are 
considered epitope-bearing peptides or polypeptides of the invention and also are 
useful for inducing antibodies that react with the mimicked protein. Preferably, the 

1 0 amino acid sequence of the epitope-bearing peptide is selected to provide substantial 
solubility in aqueous solvents {i.e., the sequence includes relatively hydrophilic 
residues and highly hydrophobic sequences are preferably avoided); and sequences 
containing proline residues are particularly preferred. 

Non-limiting examples of antigenic polypeptides or peptides that can be used 

15 to generate an enterococcal-specific immune response or antibodies include portions of 
the amino acid sequences identified in Table 1 . More specifically. Table 4 discloses a 
list of non-limiting residues that are involved in the antigenicity of the epitope-bearing 
fragmients of the present invention. Therefore, the present inventions provides for 
isolatd and purified antigenic epitope-bearing fragements of the polypeptides of the 

20 present invention comprising a peptide sequences of Table 4. The antigenic epitope- 
bearing fragments comprising a peptide sequence of Table 4 preferably contain a 
sequence of at least seven, more preferably at least nine and most preferably between 
about 10 to about 50 amino acids (i.e. any integer between 7 and 50) of a polypeptide 
of the present invention. That is, included in the present invention are antigenic 

25 polypeptides between the integers of 7 and 50 amino acid in length comprising one or 
more of the sequences of Table 4. Therefore, in most cases, the polypeptides of 
Table 4 make up only a portion of the antigenic polypeptide. All combinations of 
sequences between the integers of 7 and 50 amino acid in length comprising one or 
more of the sequences of Table 4 are included. The antigenic epitope-bearing 
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fragements may be specified by either the niunber of contiguous amino acid residues 
or by specific N-terminal and C-terminal positions as described above for the 
polypeptide fragements of the present invention, wherein the initiation codon is 
residue 1 . Any number of the described antigenic epitope-bearing fragements of the 
5 present invention may also be excluded from the present invention in the same 
manner. 

The epitope-bearing peptides and polypeptides of the invention may be 
produced by any conventional means for making peptides or polypeptides including 
recombinant means using nucleic acid molecules of the invention. For instance, an 

10 epitope-bearing amino acid sequence of the present invention may be fused to a larger 
polypeptide which acts as a carrier during recombinant production and purification, as 
well as during immunization to produce anti-peptide antibodies. Epitope-bearing 
peptides also may be synthesized using known methods of chemical synthesis. For 
instance, Houghten has described a simple method for synthesis of large numbers of 

15 peptides, such as 10-20 mg of 248 different 13 residue peptides representing single 
amino acid variants of a segment of the HAl polypeptide which were prepared and 
characterized (by ELISA-type binding studies) in less than four weeks (Houghten, R. 
A. Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985)). This "Simultaneous Multiple 
Peptide Synthesis (SMPS)" process is ftu-ther described in U.S. Patent No. 4,631,21 1 

20 to Houghten and coworkers (1986). In this procedure the individual resins for the 
solid-phase synthesis of various peptides are contained in separate solvent-permeable 
packets, enabling the optimal use of the many identical repetitive steps involved in 
solid-phase methods. A completely manual procedure allows 500-1000 or more 
syntheses to be conducted simultaneously (Houghten et al. (1985) Proc. Natl. Acad. 

25 Sci. 82:5131-5135 at 5134. 

Epitope-bearing peptides and polypeptides of the invention are used to induce 
antibodies according to methods well known in the art See, e.g., Sutcliffe, et al., 
supra;; Wilson, et al., supra;; and Bittle, et al. (1985) J. Gen. Virol. 66:2347-2354. 
Generally, animals may be immunized with free peptide; however, anti-peptide 
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antibody tiler may be boosted by coupling of the peptide to a macromolecular carrier, 
such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, peptides 
containing cysteine may be coupled to carrier using a linker such as 
m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may 
5 be coupled to carrier using a more general linking agent such as glutaraldehyde. 
Animals such as rabbits, rats and mice are immunized with either free or 
carrier-coupled peptides, for instance, by intraperitoneal and/or intradermal injection 
of emulsions containing about 100 |ig peptide or carrier protein and Freund's adjuvant. 
Several booster injections may be needed, for instance, at intervals of about two 

10 weeks, to provide a useful titer of anti-pep tide antibody which can be detected, for 
example, by ELISA assay using free peptide adsorbed to a soHd surface. The titer of 
anti-peptide antibodies in serum from an immunized animal may be increased by 
selection of anti-peptide antibodies, for instance, by adsorption to the peptide on a 
solid support and elution of the selected antibodies according to methods well known 

15 in the art. 

Immunogenic epitope-bearing peptides of the invention, Le,, those parts of a 
protein that elicit an antibody response when the whole protein is the immunogen, are 
identified according to methods known in the art. For instance, Geysen, et al, supra, 
discloses a procedure for rapid concurrent synthesis on solid supports of hundreds of 

20 peptides of sufficient purity to react in an ELISA. Interaction of synthesized 
peptides with antibodies is then easily detected without removing them from the 
support. In this manner a peptide bearing an immunogenic epitope of a desired 
protein may be identified routinely by one of ordinary skill in the art. For instance, 
the immunologically important epitope in the coat protein of foot-and-mouth disease 

25 virus was located by Geysen et al supra with a resolution of seven amino acids by 
synthesis of an overlapping set of all 208 possible hexapeptides covering the entire 
213 amino acid sequence of the protein. Then, a complete replacement set of peptides 
in which all 20 amino acids were substituted in turn at every position within the 
epitope were synthesized, and the particular amino acids conferring specificity for the 
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reaction with antibody were determined. Thus, peptide analogs of the epitope-bearing 
peptides of the invention can be made routinely by this method. U.S. Patent No. 
4,708,781 to Geysen (1987) further describes this method of identifying a peptide 
bearing an immunogenic epitope of a desired protein. 

5 Further still, U.S. Patent No. 5,194,392, to Geysen (1990), describes a general 

method of detecting or determining the sequence of monomers (amino acids or other 
compounds) which is a topological equivalent of the epitope (Le., a "mimotope") 
which is complementary to a particular paratope (antigen binding site) of an antibody 
of interest. More generally, U.S. Patent No. 4,433,092, also to Geysen (1989), 

10 describes a method of detecting or determining a sequence of monomers which is a 
topographical equivalent of a Ugand which is complementary to the ligand binding site 
of a particular receptor of interest. Similarly, U.S. Patent No. 5,480,971 to Houghten, 
R. A. et al (1996) discloses linear CpCy-alkyl peralkylated oligopeptides and sets and 
libraries of such peptides, as well as methods for using such oligopeptide sets and 

15 libraries for determining the sequence of a peralkylated oligopeptide that preferentially 
binds to an acceptor molecule of interest. Thus, non-peptide analogs of the 
epitope-bearing peptides of the invention also can be made routinely by these 
methods. The entire disclosure of each document cited in this section on 
"Polypeptides and Fragments" is hereby incorporated herein by reference. 

20 As one of skill in the art will appreciate, the polypepitides of the present 

invention and the epitope-bearing fragments thereof described above can be combined 
with parts of the constant domain of immunoglobulins (IgG), resulting in chimeric 
polypeptides. These fusion proteins facilitate purification and show an increased 
half-life in vivo. This has been shoAvn, e.g., for chimeric proteins consisting of the 

25 first two domains of the human CD4-polypeptide and various domains of the 

constant regions of the heavy or hght chains of mammalian immunoglobulins. (EPA 
0,394,827; Traunecker et al. (1 988) Nature 33 1 :84-86. Fusion proteins that have a 
disulfide-linked dimeric structure due to the IgG part can also be more efficient in 
binding and neutralizing other molecules than a monomeric E, faecalis polypeptide or 
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fragment thereof alone. See Fountoulakis et al. (1995) J. Biochem. 270:3958-3964. 
Nucleic acids encoding the above epitopes of E.faecalis polypeptides can also be 
recombined with a gene of interest as an epitope tag to aid in detection and 
purification of the expressed polypeptide. 

5 

Antibodies 

E.faecalis protein-specific antibodies for use in the present invention can be 
raised against the intact E.faecalis protein or an antigenic polypeptide fragment 
thereof, which may be presented together with a carrier protein, such as an albumin, to 

10 an animal system (such as rabbit or mouse) or, if it is long enough (at least about 25 
amino acids), without a carrier. 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 
meant to include intact molecules, single chain whole antibodies, and antibody 
fragments. Antibody fragments of the present invention include Fab and F(ab')2 and 

15 other fragments including single-chain Fvs (scFv) and disulfide-linked Fvs (sdFv). 
Also included in the present invention are chimeric and humanized monoclonal 
antibodies and polyclonal antibodies specific for the polypeptides of the present 
invention. The antibodies of the present invention may be prepared by any of a 
variety of methods. For example, cells expressing a polypeptide of the present 

20 invention or an antigenic fragment thereof can be administered to an animal in order to 
induce the production of sera containing polyclonal antibodies. For example, a 
preparation of E. faecalis polypeptide or fragment thereof is prepared and purified to 
render it substantially free of natural contaminants. Such a preparation is then 
introduced into an animal in order to produce polyclonal antisera of greater specific 

25 activity. 

In a preferred method, the antibodies of the present invention are monoclonal 
antibodies or binding fragments thereof Such monoclonal antibodies can be prepared 
using hybridoma technology. See, e.g., Harlow et al., ANTIBODIES: A 
LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988); 
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Hammerling, et al, in: MONOCLONAL ANTIBODIES AND T-CELL 
HYBRIDOMAS 563-681 (Elsevier, N.Y., 1981). Fab and F(ab02 fragments may be 
produced by proteolytic cleavage, using enzymes such as papain (to produce Fab 
fragments) or pepsin (to produce F(ab*)2 fragments). Alternatively, E.faecalis 
5 polypeptide-binding fragments, chimeric, and humanized antibodies can be produced 
through the application of recombinant DNA technology or through synthetic 
chemistry using methods known in the art. 

Alternatively, additional antibodies capable of binding to the polypeptide 
antigen of the present invention may be produced in a two-step procedure through the 

10 use of anti-idiotypic antibodies. Such a method makes use of the fact that antibodies 
are themselves antigens, and that, therefore, it is possible to obtain an antibody which 
binds to a second antibody. In accordance vdth this method, E.faecalis 
polypeptide-specific antibodies are used to immunize an animal, preferably a mouse. 
The splenocyles of such an animal are then used to produce hybridoma cells, and the 

15 hybridoma cells are screened to identify clones which produce an antibody whose 
ability to bind to the E. faecalis polypeptide-specific antibody can be blocked by the 
E,faecalis polypeptide antigen. Such antibodies comprise anti-idiotypic antibodies to 
the E.faecalis polypeptide-specific antibody and can be used to immunize an animal 
to induce formation of further E.faecalis polypeptide-specific antibodies. 

20 Antibodies and fragements thereof of the present invention may be described 

by the portion of a polypeptide of die present invention recognized or specifically 
boimd by the antibody. Antibody binding fragements of a polypeptide of the present 
invention may be described or specified in the same manner as for polypeptide 
fragements discussed above., i.e, by N-terminal and C-terminal positions or by size in 

25 contiguous amino acid residues. Any number of antibody binding fragments, of a 
polypeptide of the present invention, specified by N-terminal and C-terminal 
positions or by size in amino acid residues, as described above, may also be excluded 
from the present invention. Therefore, the present invention includes antibodies the 
specifically bind a particuarlly discribed fragement of a polypeptide of the present 
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invention and allows for the exclusion of the same. 

Antibodies and fragements thereof of the present invention may also be 
described or specified in terms of their cross-reactivity. Antibodies and fragements 
that do not bind polypeptides of any other species of Enterococcus other than E. 
5 faecalis are included in the present invention. Likewise, antibodies and fragements 
that bind only species of Enterococcus, i.e. antibodies and fragements that do not bind 
bacteria from any genus other than Enterococcus, are included in the present 
invention. 

1 0 Diagnostic Assays 

The present invention further relates to methods for assaying staphylococcal 
infection in an animal by detecting the expression of genes encoding staphylococcal 
polypeptides of the present invention. The methods comprise analyzing tissue or 
body fluid from the animal for Enterococctis-spcciTxz antibodies, nucleic acids, or 

15 proteins. Analysis of nucleic acid specific to £/z/erococcM5 is assayed by PCR or 
hybridization techniques using nucleic acid sequences of the present invention as 
either hybridization probes or primers. See, e.g., Sambrook et aL Molecular cloning: 
A Laboratory Manual (Cold Spring Harbor Laboratory Press, 2nd ed., 1989, page 54 
reference); Eremeeva et al (1994) J. Clin. Microbiol. 32:803-810 (describing 

20 differentiation among spotted fever group Rickettsiae species by analysis of restriction 
fragment length polymorphism of PCR-amplified DNA) and Chen et aL 1994 J. Clin. 
Microbiol. 32:589-595 (detecting 5. burgdorferi nucleic acids via PCR). 

Where diagnosis of a disease state related to infection with Enterococcus has 
already been made, the present invention is useful for monitoring progression or 

25 regression of the disease state whereby patients exhibiting enhanced Enterococcus 

gene expression will experience a worse clinical outcome relative to patients expressing 
these gene(s) at a lower level. 

By "biological sample" is intended any biological sample obtained from an 
animal, cell line, tissue culture, or other source which contains Enterococcus 
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polypeptide, mRNA, or DNA. Biological samples include body fluids (such as saliva, 
blood, plasma, urine, mucus, synovial fluid, etc.) tissues (such as muscle, skin, and 
cartilage) and any other biological source suspected of containing Enterococcus 
polypeptides or nucleic acids. Methods for obtaining biological samples such as 
5 tissue are well known in the art. 

The present invention is useful for detecting diseases related to Enterococcus 
infections in animals. Preferred animals include monkeys, apes, cats, dogs, birds, 
cows, pigs, mice, horses, rabbits and humans. Particularly preferred are humans. 
Total RNA can be isolated from a biological sample using any suitable 

10 technique such as the single-step guanidinium-thiocyanate-phenol-chloroform method 
described in Chomczynski et al. (1987) Anal. Biochem. 162:156-159. mRNA encoding 
Enterococcus polypeptides having sufficient homology to the nucleic acid sequences 
identified in Table 1 to allow for hybridization between complementary sequences are 
then assayed using any appropriate method. These include Northern blot analysis, SI 

1 5 nuclease mapping, the polymerase chain reaction (PCR), reverse transcription in 

combination with the polymerase chain reaction (RT-PCR), and reverse transcription 
in combination with the ligase chain reaction (RT-LCR). 

Northern blot analysis can be performed as described in Harada et al. (1990) 
Cell 63:303-312. Briefly, total RNA is prepared from a biological sample as described 

20 above. For the Northern blot, the RNA is denatured in an appropriate buffer (such as 
glyoxal/dimethyl sulfoxide/sodiimi phosphate buffer), subjected to agarose gel 
electrophoresis, and transferred onto a nitrocellulose filter. After the RNAs have been 
linked to the filter by a UV linker, the filter is prehybridized in a solution containing 
formamide, SSC, Denhardt's solution, denatured sabnon sperm, SDS, and sodium 

25 phosphate buffer. A E. faecalis polynucleotide sequence shown in Table 1 labeled 
according to any appropriate method (such as the ^^P-multiprimed DNA labeling 
system (Amersham)) is used as probe. After hybridization ovemight, the filter is 
washed and exposed to x-ray film. DNA for use as probe according to the present 
invention is described in the sections above and will preferably at least 1 5 nucleotides 
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in length. 

SI mapping can be performed as described in Fujita et al. (1987) Cell 
49:357-367. To prepare probe DNA for use in SI mapping, the sense strand of an 
above-described E.faecalis DNA sequence of the present invention is used as a 
5 template to synthesize labeled antisense DNA, The antisense DNA can then be 
digested using an appropriate restriction endonuclease to generate further DNA 
probes of a desired length. Such antisense probes are useful for visualizing protected 
bands corresponding to the target mRNA (/.e., mRNA encoding Enterococcus 
polypeptides). 

10 Levels of mRNA encoding Enterococcus polypeptides are assayed, for e.g., 

using the RT-PCR method described in Makino et al. (1990) Technique 2:295-301 . 
By this method, the radioactivities of the "amplicons" in the polyacrylamide gel bands 
are linearly related to the initial concentration of the target mRNA. Briefly, this 
method involves adding total RNA isolated from a biological sample in a reaction 

15 mixture containing a RT primer and appropriate buffer. After incubating for primer 
aimealing, the mixture can be supplemented with a RT buffer, dNTPs, DTT, RNase 
inhibitor and reverse transcriptase. After incubation to achieve reverse transcription 
of the RNA, the RT products are then subject to PCR using labeled primers. 
Alternatively, rather than labeling the primers, a labeled dNTP can be included in the 

20 PCR reaction mixture. PCR ampUfication can be performed in a DNA thermal cycler 
according to conventional techniques. After a suitable number of rounds to achieve 
amplification, the PCR reaction mixture is electrophoresed on a polyacrylamide gel. 
After drying the gel, the radioactivity of the appropriate bands (corresponding to the 
mRNA encoding the Enterococcus polypeptides of the present invention) are 

25 quantified using an imaging analyzer. RT and PCR reaction ingredients and 

conditions, reagent and gel concentrations, and labeling methods are known in the 
art. Variations on the RT-PCR method will be apparent to the skilled artisan. Other 
PCR methods that can detect the nucleic acid of the present invention can be found in 
PCR PRIMER: A LABORATORY MANUAL (C.W. Dieffenbach et al. eds., Cold 
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Spring Harbor Lab Press, 1995). 

The polynucleotides of the present invention, including both DNA and RNA, 
may be used to detect polynucleotides of the present invention or Enterococcal 
species including E,faecalis using bio chip technology. The present invention 

5 includes both high density chip arrays (>1 000 oligonucleotides per cm^) and low 

density chip arrays (<1000 oligonucleotides per cm^). Bio chips comprising arrays of 
polynucleotides of the present invention may be used to detect Enterococcal species, 
including E. faecalis, in biological and environmental samples and to diagnose an 
animal, including humans, with an E.faecalis or other Enterococcal infection. The bio 

10 chips of the present invention may comprise polynucleotide sequences of other 

pathogens including bacteria, viral, parasitic, and fungal polynucleotide sequences, in 
addition to the polynucleotide sequences of the present invention, for use in rapid 
diffenertial pathogenic detection and diagnosis. The bio chips can also be used to 
monitor an E, faecalis or other Enterococcal infections and to monitor the genetic 

15 changes (deletions, insertions, mismatches, etc.) in response to drug therapy in the 
clinic and drug development in the laboratory. The bio chip technology comprising 
arrays of poljmucleotides of the present invention may also be used to simultaneously 
monitor the expression of a multiplicity of genes, including those of the present 
invention. The polynucleotides used to comprise a selected array may be specified in 

20 the same manner as for the fragements, i.e, by their 5' and 3* positions or length in 
contiguous base pairs and include from. Methods and particular uses of the 
polynucleotides of the present invention to detect Enterococcal species, including E, 
faecalis, using bio chip technology include those known in the art and those of: U.S. 
Patent Nos. 5510270, 5545531, 5445934, 5677195, 5532128, 5556752, 5527681, 

25 5451683, 5424186, 5607646, 5658732 and Worid Patent Nos. WO/9710365, 
WO/951 1995, WO/9743447, WO/9535505, each incorporated herein in their 
entireties. 

Biosensors using the polynucleotides of the present invention may also be 
used to detect, diagnose, and monitor E.faecalis or other Enterococcal species and 
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infections thereof. Biosensors using the polynucleotides of the present invention may 
also be used to detect particular polynucleotides of the present invention. Biosensors 
using the polynucleotides of the present invention may also be used to monitor the 
genetic changes (deletions, insertions, mismatches, etc.) in response to drug therapy in 
5 the clinic and drug development in the laboratory. Methods and particular uses of the 
polynucleotides of the present invention to detect Enterococcal species, including E. 
faecalis, using biosenors include those known in the art and those of: U.S. Patent Nos 
5721102, 5658732, 5631170, and World Patent Nos. WO97/35011, WO/9720203, 
each incorporated herein in their entireties. 

10 Thus, the present invention includes both bio chips and biosensors comprising 

polynucleotides of the present invention and methods of their use. 

Assaying Enterococcus polypeptide levels in a biological sample can occur 
using any art-known method, such as antibody-based techniques. For example, 
Enterococcus polypeptide expression in tissues can be studied udth classical 

15 immunohistological methods. In these, the specific recognition is provided by the 
primary antibody (polyclonal or monoclonal) but the secondary detection system can 
utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a result, an 
immunohistological staining of tissue section for pattiological examination is obtained. 
Tissues can also be extracted, e.g., with urea and neutral detergent, for the liberation of 

20 Enterococcus polypeptides for Western-blot or dot/slot assay. See, e.g., Jalkanen, M. 
et al. (1985) J. Cell. Biol. 101:976-985; Jalkanen, M. et al. (1987) J. Cell . Biol. 
105:3087-3096. In this technique, which is based on the use of cationic solid phases, 
quantitation of a Enterococcus polypeptide can be accomplished using an isolated 
Enterococcus polypeptide as a standard. This technique can also be applied to body 

25 fluids. 

Other antibody-based methods useful for detecting Enterococcus polypeptide 
gene expression include immunoassays, such as the ELISA and the radioimmunoassay 
(RIA). For example, a Enterococcus polypeptide-specific monoclonal antibodies can 
be used both as an immunoabsorbent and as an enzyme-labeled probe to detect and 
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quantify a Enterococcus polypeptide. The amount of a Enterococcus polypeptide 
present in the sample can be calculated by reference to the araoimt present in a 
standard preparation using a linear regression computer algorithm. Such an ELISA is 
described in lacobelli et al. (1988) Breast Cancer Research and Treatment 1 1 :19-30. In 

5 another ELISA assay, two distinct specific monoclonal antibodies can be used to 

detect Enterococcus polypeptides in a body fluid. In this assay, one of the antibodies 
is used as the immunoabsorbent and the other as the enzyme-labeled probe. 

The above techniques may be conducted essentially as a "one-step" or 
"two-step" assay. The "one-step" assay involves contacting the Enterococcus 

10 polypeptide with immobilized antibody and, without washing, contacting the mixture 
with the labeled antibody. The "two-step" assay involves washing before contacting 
the mixture with the labeled antibody. Other conventional methods may also be 
employed as suitable. It is usually desirable to inmiobilize one component of the 
assay system on a support, thereby allowing other components of the system to be 

15 brought into contact with the component and readily removed from the sample. 
Variations of the above and other immunological methods included in the present 
invention can also be found in Harlow et al, ANTIBODIES: A LABORATORY 
MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988). 

Suitable enzyme labels include, for example, those from the oxidase group, 

20 which catalyze the production of hydrogen peroxide by reacting with substrate. 
Glucose oxidase is particularly preferred as it has good stability and its substrate 
(glucose) is readily available. Activity of an oxidase label may be assayed by 
measuring the concentration of hydrogen peroxide formed by the enzyme-labeled 
antibody/substrate reaction. Besides enzymes, other suitable labels include 

25 radioisotopes, such as iodine (*^^I, ^^^I), carbon ('"^C), sulphur (^^S), tritium (^H), 

indium (^^^In), and technetium (^^"*Tc), and fluorescent labels, such as fluorescein and 
rhodamine, and biotin. 

Further suitable labels for the Enterococcus polypeptide-specific antibodies of 
the present invention are provided below. Examples of suitable enzyme labels include 
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malate dehydrogenase, Enterococcal nuclease, delta-5-steroid isomerase, yeast-alcohol 
dehydrogenase, alpha-glycerol phosphate dehydrogenase, triose phosphate isomerase, 
peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, 
ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, and 

5 acetylcholine esterase. 

Examples of suitable radioisotopic labels include ^H, '^^In, '^^I, ^^'l, ^^P, ^^S, 
»^C, ^'Cr, "To, ^«Co, ^^Fe, ^^Se, *"Eu, ^^Cu, ^^^Ci, ^^'At, ^^^pb, ^^Sc, '«^Pd, etc. 
* ^ *In is a preferred isotope where in vivo imaging is used since its avoids the problem 
of dehalogenation of the ^^^I or ^•'^I-labeled monoclonal antibody by the liver. In 

1 0 addition, this radionucleotide has a more favorable gamma emission energy for imaging. 
See, e.g., Perkins et al. (1985) Eur. J. NucL Med. 10:296-301; Carasquillo et al. 
(1987) J. Nucl. Med. 28:281-287. For example, "^In coupled to monoclonal 
antibodies with 1 -(P-isothiocyanatobenzyl)-DPTA has shown little uptake in 
non-tumors tissues, particularly the liver, and therefore enhances specificity of tumor 

15 localization. See, Esteban et al. (1987) J. Nucl. Med. 28:861-870, 

Examples of suitable non-radioactive isotopic labels include ^^^Gd, ^^Mn, 

Examples of suitable fluorescent labels include an ^^^Eu label, a fluorescein 
label, an isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin 
20 label, an allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label. 

Examples of suitable toxin labels include, Pseudomonas toxin, diphtheria toxin, 
ricin, and cholera toxin. 

Examples of chemiluminescent labels include a luminal label, an isoluminal 
label, an aromatic acridinium ester label, an imidazole label, an acridinium salt label, an 
25 oxalate ester label, a luciferin label, a luciferase label, and an aequorm label. 

Examples of nuclear magnetic resonance contrasting agents include heavy metal 
nuclei such as Gd, Mn, and iron. 

Typical techniques for binding the above-described labels to antibodies are 
provided by Kennedy et al. (1976) Clin. Chim. Acta 70: 1-3 1, and Schurs et al. (1977) 
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Clin. Chim. Acta 8 1 : 1-40. Coupling techniques mentioned in the latter are the 
glutaraldehyde method, the periodate method, the dimaleimide method, the 
m-maleimidobenzyl-N-hydroxy-succinimide ester method, all of which methods are 
incorporated by reference herein. 

5 In a related aspect, the invention includes a diagnostic kit for use in screening 

serum containing antibodies specific against E. faecalis infection. Such a kit may 
include an isolated E. faecalis antigen comprising an epitope which is specifically 
immunoreactive with at least one anti-^. faecalis antibody. Such a kit also includes 
means for detecting the binding of said antibody to the antigen. In specific 

10 embodiments, the kit may include a recombinantly produced or chemically 

synthesized peptide or polypeptide antigen. The peptide or polypeptide antigen 
may be attached to a solid support. 

In a more specific embodiment, the detecting means of the above-described kit 
includes a solid support to which said peptide or polypeptide antigen is attached. 

1 5 Such a kit may also include a non-attached reporter-labeled anti-human antibody. In 
this embodiment, binding of the antibody to the E. faecalis antigen can be detected by 
binding of the reporter labeled antibody to the anti-jE*. faecalis polypeptide antibody. 

In a related aspect, the invention includes a method of detecting E. faecalis 
infection in a subject. This detection method includes reacting a body fluid, preferably 

20 serum, from the subject with an isolated E, faecalis antigen, and examining the antigen 
for the presence of bound antibody. In a specific embodiment, the method includes a 
polypeptide antigen attached to a solid support, and serum is reacted with the 
support. Subsequently, the support is reacted with a reporter-labeled anti-himian 
antibody. The support is then examined for the presence of reporter-labeled 

25 antibody. 

The solid surface reagent employed in the above assays and kits is prepared 
by known techniques for attaching protein material to solid support material, such as 
polymeric beads, dip sticks, 96-well plates or filter material. These attachment 
methods generally include non-specific adsorption of the protein to the support or 
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covalent attachment of the protein , typically through a free amine group, to a 
chemically reactive group on the solid support, such as an activated carboxyl, 
hydroxyl, or aldehyde group. Alternatively, streptavidin coated plates can be used in 
conjunction with biotinylated antigen(s). 
5 The polypeptides and antibodies of the present invention, including fragments 

thereof, may be used to detect Enterococcal species including E.faecalis using bio chip 
and biosensor technology. Bio chip and biosensors of the present invention may 
comprise the polypeptides of the present invention to detect antibodies, which 
specifically recognize Enterococcal species, including E,faecalis, Bio chip and 

10 biosensors of the present invention may also comprise antibodies which specifically 
recognize the polypeptides of the present invention to detect Enterococcal species, 
including E.faecalis or specific polypeptides of the present invention. Bio chips or 
biosensors comprising polypeptides or antibodies of the present invention may be 
used to detect Enterococcal species, including E, faecalis, in biological and 

15 environmental samples and to diagnose an animal, including humans, with an E. 

faecalis or other Enterococcal infection. Thus, the present invention includes both bio 
chips and biosensors comprising polypeptides or antibodies of the present invention 
and methods of their use. 

The bio chips of the present invention may fiirther comprise polypeptide 

20 sequences of other pathogens including bacteria, viral, parasitic, and fungal 

polypeptide sequences, in addition to the polypeptide sequences of the present 
invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio 
chips of the present invention may further comprise antibodies or fragements thereof 
specific for other pathogens including bacteria, viral, parasitic, and ftingal polypeptide 

25 sequences, in addition to the antibodies or fragements thereof of the present invention, 
for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips and 
biosensors of the present invention may also be used to monitor an E.faecalis or other 
Enterococcal infection and to monitor the genetic changes (amio acid deletions, 
insertions, substitutions, etc.) in response to drug therapy in the clinic and drug 
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development in the laboratory. The bio chip and biosensors comprising polypeptides 
or antibodies of the present invention may also be used to simultaneously monitor the 
expression of a multiplicity of polypeptides, including those of the present invention. 
The polypeptides used to comprise a bio chip or biosensor of the present invention 
5 may be specified in the same manner as for the fragements, i.e, by their N-terminal and 
C-terminal positions or length in contigious amino acid residue. Methods and 
particular uses of the polypeptides and antibodies of the present invention to detect 
Enterococcal species, including E.faecalis, or specific polypeptides using bio chip and 
biosensor technology include those known in the art, those of the U.S. Patent Nos. 
10 and World Patent Nos. listed above for bio chips and biosensors using 

polynucleotides of the present invention, and those of: U,S. Patent Nos. 5658732, 
5135852, 5567301, 5677196, 5690894 and World Patent Nos. W09729366, 
W096 12957, each incorporated herein in their entireties. 

15 Treatment: 

Agonists and Antagonists - Assays and Molecules 

The invention also provides a method of screening compoxmds to identify 
those which enhance or block the biological activity of the E.faecalis polypeptides of 
the present invention. The present invention further provides where the compounds 
20 kill or slow the grov^h of E. faecalis. The ability of E, faecalis antagonists, including 
E.faecalis ligands, to prophylactically or therapeutically block antibiotic resistance 
may be easily tested by the skilled artisan. See, e.g., Straden et al. (1997) J Bacteriol. 
179(1):9-16. 

An agonist is a compound which increases the natural biological function or 
25 which functions in a manner similar to the polypeptides of the present invention, 
while antagonists decrease or eliminate such functions. Potential antagonists include 
small organic molecules, peptides, polypeptides, and antibodies that bind to a 
polypeptide of the invention and thereby inhibit or extinguish its activity. 

The antagonists may be employed for instance to inhibit peptidoglycan cross 
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bridge formation. Antibodies against E.faecalis may be employed to bind to and 
inhibit E.faecalis activity to treat antibiotic resistance. Any of the above antagonists 
may be employed in a composition with a pharmaceutically acceptable carrier. 

5 Vaccines 

The present invention also provides vaccines comprising one or more 

polypeptides of the present invention. Heterogeneity in the composition of a vaccine 

may be provided by combining E. faecalis polypeptides of the present invention. 

Multi-component vaccines of this type are desirable because they are likely to be 
10 more effective in eliciting protective immune responses against multiple species and 

strains of the Enterococcus genus than single polypeptide vaccines. 

Multi-component vaccines are known in the art to elicit antibody production 

to numerous immunogenic components. See, e.g., Decker et al. (1996) J. Infect. Dis. 

174:8270-275. In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent 
15 vaccine has recently been demonstrated to ehcit protective levels of antibodies in 

human infants against all four pathogenic agents. See, e.g., Aristegui, J. et al. (1997) 

Vaccine 15:7-9. 

The present invention in addition to single-component vaccines includes 
multi-component vaccines. These vaccines comprise more tiian one polypeptide, 

20 immunogen or antigen. Thus, a multi-component vaccine would be a vaccine 

comprising more than one of the E.faecalis polypeptides of the present invention. 

Further within the scope of the invention are whole cell and whole viral 
vaccines. Such vaccines may be produced recombinantly and involve the expression 
of one or more of the E. faecalis polypeptides described in Table 1 . For example, the 

25 E. faecalis polypeptides of the present invention may be either secreted or localized 
intracellular, on ihe cell surface, or in the periplasmic space. Further, when a 
recombinant virus is used, the E. faecalis polypeptides of the present invention may, 
for example, be locaHzed in the viral envelope, on the surface of the capsid, or 
internally within the capsid. Whole cells vaccines which employ cells expressing 
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heterologous proteins are known in the art. See, e.g., Robinson, K. et al. (1997) 
Nature Biotech. 15:653-657; Sirard, J. et al. (1997) Infect. Immun. 65:2029-2033; 
Chabalgoity, J. et al. (1997) Infect. Immun. 65:2402-2412 . These cells may be 
administered live or may be killed prior to administration. Chabalgoity, J. et al., supra, 
5 for example, report the successful use in mice of a live attenuated Salmonella vaccine 
strain which expresses a portion of a platyhelminth fatty acid-binding protein as a 
fusion protein on its cells surface. 

A multi-component vaccine can also be prepared using techniques known in 
the art by combining one or more E. faecalis polypeptides of the present invention, or 

10 fragments thereof, with additional non-Enterococcal components (e.g., diphtheria 
toxin or tetanus toxin, and/or other compounds known to elicit an immune response). 
Such vaccines are useful for eliciting protective immune responses to both members of 
the Enterococciis genus and non-Enterococcal pathogenic agents. 

The vaccines of the present invention also include DNA vaccines. DNA 

15 vaccines are currently being developed for a number of infectious diseases. See, et al, 
Boyer, et al. (1997) Nat. Med. 3:526-532; reviewed in Spier, R. (1996) Vaccine 
14: 1285-1288. Such DNA vaccines contain a nucleotide sequence encoding one or 
more E. faecalis polypeptides of the present invention oriented in a manner that 
allows for expression of the subject polypeptide. For example, the direct 

20 administration of plasmid DNA encoding B. burgdorgeri OspA has been shown to 
ehcit protective immunity in mice against borrelial challenge. See, Luke et al. (1997) J. 
Infect. Dis. 175:91-97. 

The present invention also relates to the administration of a vaccine which is 
co-administered with a molecule capable of modulating immune responses. Kim et al. 

25 ( 1 997) Nature Biotech. 1 5 :64 1 -646, for example, report the enhancement of immune 
responses produced by DNA immunizations when DNA sequences encoding 
molecules which stimulate the immune response are co-administered. In a similar 
fashion, the vaccines of the present invention may be co-administered with either 
nucleic acids encoding immune modxdators or the immune modulators themselves. 
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These immune modulators include granulocyte macrophage colony stimulating factor 
(GM-CSF) and CD86. 

The vaccines of the present invention may be used to confer resistance to 
Enterococcal infection by either passive or active immunization. When the vaccines of 

5 the present invention are used to confer resistance to Enterococcal infection through 
active immunization, a vaccine of the present invention is administered to an animal to 
elicit a protective immune response which either prevents or attenuates a Enterococcal 
infection. When the vaccines of the present invention are used to confer resistance to 
Enterococcal infection through passive immunization, the vaccine is provided to a host 

10 animal (e.g., hiunan, dog, or mouse), and the antisera elicited by this antisera is 

recovered and directly provided to a recipient suspected of having an infection caused 
by a member of the Enterococcus genus. 

The abihty to label antibodies, or fragments of antibodies, vnth toxin molecules 
provides an additional method for treating Enterococcal infections when passive 

15 immunization is conducted. In this embodiment, antibodies, or fragments of 

antibodies, capable of recognizing the E.faecalis polypeptides disclosed herein, or 
fragments thereof, as well as other Enterococcus proteins, are labeled with toxin 
molecules prior to their administration to the patient. When such toxin derivatized 
antibodies bind to Enterococcus cells, toxin moieties wdll be localized to these cells and 

20 will cause their death. 

The present invention thus concerns and provides a means for preventing or 
attenuating a Enterococcal infection resulting from organisms which have antigens that 
are recognized and bound by antisera produced in response to the polypeptides of the 
present invention. As used herein, a vaccine is said to prevent or attenuate a disease if 

25 its administration to an animal results either in the total or partial attenuation (/.e., 
suppression) of a symptom or condition of the disease, or in the total or partial 
inununity of the animal to the disease. 

The administration of the vaccine (or the antisera which it elicits) may be for 
either a "prophylactic" or "therapeutic" purpose. When provided prophylactically. 
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the conipound(s) are provided in advance of any symptoms of Enterococcal infection. 
The prophylactic administration of the compound(s) serves to prevent or attenuate 
any subsequent infection. When provided therapeutically, the compound(s) is 
provided upon or after the detection of symptoms which indicate that an animal may 

5 be infected with a member of the Enterococcus genus. The therapeutic administration 
of the compound(s) serves to attenuate any actual infection. Thus, the E. faecalis 
polypeptides, and fragments thereof, of the present invention may be provided either 
prior to the onset of infection (so as to prevent or attenuate an anticipated infection) 
or after the initiation of an actual infection. 

10 The polypeptides of the invention, whether encoding a portion of a native 

protein or a functional derivative thereof, may be administered in pure form or may be 
coupled to a macromolecular carrier. Example of such carriers are proteins and 
carbohydrates. Suitable proteins which may act as macromolecular carrier for 
enhancing the immunogenicity of the polypeptides of the present invention include 

15 keyhole limpet hemacyanin (KLH) tetanus toxoid, pertussis toxin, bovine serum 
albumin, and ovalbumin. Methods for coupling the polypeptides of the present 
invention to such macromolecular carriers are disclosed in Harlow et al, 
ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory 
Press, 2nd ed. 1988). 

20 A composition is said to be "pharmacologically or physiologically acceptable" 

if its administration can be tolerated by a recipient animal and is otherwise suitable for 
administration to that animal. Such an agent is said to be administered in a 
"therapeutically effective amount" if the amount administered is physiologically 
significant. An agent is physiologically significant if its presence results in a 

25 detectable change in the physiology of a recipient patient. 

While in all instances the vaccine of the present invention is administered as a 
pharmacologically acceptable compound, one skilled in the art would recognize that 
the composition of a pharmacologically acceptable compound varies with the animal 
to which it is administered. For example, a vaccine intended for human use will 



wo 98/50554 



-57- 



PCT/US98/08959 



generally not be co-administered with Freund's adjuvant. Further, the level of purity 
of the E, faecalis polypeptides of the present invention will normally be higher when 
administered to a human than when administered to a non-human animal. 

As would be understood by one of ordinary skill in the art, when the vaccine 

5 of the present invention is provided to an animal, it may be in a composition which 
may contain salts, buffers, adjuvants, or other substances which are desirable for 
improving the efficacy of the composition. Adjuvants are substances that can be used 
to specifically augment a specific unmune response. These substances generally 
perform two functions: (1) they protect the antigen(s) from being rapidly catabolized 

10 after administration and (2) they nonspecifically stimulate immune responses. 

Normally, the adjuvant and the composition are mixed prior to presentation to 
the immune system, or presented separately, but into the same site of the animal being 
immunized. Adjuvants can be loosely divided into several groups based upon their 
composition. These groups include oil adjuvants (for example, Freuntfs complete and 

15 incomplete), mineral salts (for example, A1K(S04)2, AlNa(S04)2, A1NH4(S04), silica, 
kaolin, and carbon), polynucleotides (for example, poly IC and poly AU acids), and 
certain natural substances (for example, wax D from Mycobacterium tuberculosis, as 
well as substances found in Corynebacterium parvum, or Bordetella pertussis, and 
members of the genus Brucella. Other substances useful as adjuvants are the saponins 

20 such as, for example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for use in 
the present invention include aluminum salts, such as A1K(S04)2, AlNa(S04)2, and 
A1NH4(S04). Examples of materials suitable for use in vaccine compositions are 
provided in REMINGTON'S PHARMACEUTICAL SCIENCES 1324-1341 (A. 
Osol, ed. Mack Publishing Co, Easton, PA, (1980) (incorporated herein by reference). 

25 The therapeutic compositions of the present invention can be administered 

parenterally by injection, rapid infusion, nasopharyngeal absorption 
(intranasopharangeally), dermoabsorption, or orally. The compositions may 
alternatively be administered intramuscularly, or intravenously. Compositions for 
parenteral administration include sterile aqueous or non-aqueous solutions, 
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suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such 
as ethyl oleate. Carriers or occlusive dressings can be used to increase skin 
permeability and enhance antigen absorption. Liquid dosage forms for oral 
5 administration may generally comprise a liposome solution containing the liquid 
dosage form. Suitable forms for suspending liposomes include emulsions, suspen- 
sions, solutions, syrups, and elixirs containing inert diluents commonly used in the art, 
such as purified water. Besides the inert diluents, such compositions can also include 
adjuvants, wetting agents, emulsifying and suspending agents, or sweetening, 

1 0 flavoring, or perfuming agents. 

Therapeutic compositions of the present invention can also be administered in 
encapsulated form. For example, intranasal immunization using vaccines encapsulated 
in biodegradable microsphere composed of poly(DL-lactide-co-glycolide). See, 
Shahin, R. et al. (1995) Infect. Immun. 63:1 195-1200. Similarly, orally administered 

15 encapsulated Salmonella typhimurium antigens can also be used. Allaoui-Attarki, K. 
et al. (1997) Infect. Immun. 65:853-857. Encapsulated vaccines of the present 
invention can be administered by a variety of routes including those involving 
contacting the vaccine with mucous membranes (e.g., intranasally, intracolonicly, 
intraduodenally). 

20 Many different techniques exist for the timing of the immunizations when a 

multiple administration regimen is utilized. It is possible to use the compositions of 
the invention more than once to increase the levels and diversities of expression of the 
immunoglobulin repertoire expressed by the immunized animal. Typically, if multiple 
immunizations are given, they will be given one to two months apart. 

25 According to the present invention, an "effective amount" of a therapeutic 

composition is one which is sufficient to achieve a desired biological effect. Generally, 
the dosage needed to provide an effective amount of the composition will vary 
depending upon such factors as the animal's or human*s age, condition, sex, and extent 
of disease, if any, and other variables which can be adjusted by one of ordinary skill in 
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the art. 

The antigenic preparations of the invention can be administered by either 
single or multiple dosages of an effective amount. Effective amounts of the 
compositions of the invention can vary from 0.01-1,000 ^xg/ml per dose, more 
5 preferably 0. 1 -500 |Lig/ml per dose, and most preferably 1 0-300 ^.g/ml per dose. 

Examples 

Example 1: Isolation of a Selected DNA Clone From the Deposited Sample of E. 
faecalis 

1 0 Three approaches can be used to isolate a E. faecalis clone comprising a 

polynucleotide of the present invention from any E, faecalis genomic DNA library. 
The E. faecalis strain V586 has been deposited as a convienent source for obtaining a 
E. faecalis strain although a wide varity of strains E, faecalis strains can be used which 
are known in the art. 

15 E, faecalis genomic DNA is prepared using the followdng method. A 20ml 

ovemight bacterial culture grown in a rich medium (e.g., Ttypticase Soy Broth, Brain 
Heart Infusion broth or Super broth), pelleted, washed two times with TES (30mM 
Tris-pH 8.0, 25mM EDTA, 50mM NaCl), and resuspended in 5ml high salt TES 
(2.5M NaCl). Lysostaphin is added to final concentration of approx 50ug/ml and the 

20 mixture is rotated slowly 1 hour at 37C to make protoplast cells. The solution is then 
placed in incubator (or place in a shaking water bath) and warmed to 55C. Five 
hundred micro liter of 20% sarcosyl in TES (final concentration 2%) is then added to 
lyse the cells. Next, guanidine HCl is added to a final concentration of 7M (3.69g in 
5.5 ml). The mixture is swirled slowly at 55C for 60-90 min (solution should clear). 

25 A CsCl gradient is then set up in SW41 ultra clear tubes using 2.0ml 5.7M CsCl and 
overlaying with 2.85M CsCl. The gradient is carefully overlayed with the DNA- 
containing GuHCl solution. The gradient is spun at 30,000 rpm, 20C for 24 hr and 
the lower DNA band is collected. The volume is increased to 5 ml with TE buffer. 
The DNA is then treated with protease K (10 ug/ml) ovemight at 37 C, and 
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precipitated with ethanol. The precipitated DNA is resuspended in a desired buffer. 

In the first method, a plasmid is directly isolated by screening a plasmid E. 
faecalis genomic DNA library using a polynucleotide probe corresponding to a 
polynucleotide of the present invention. Particularly, a specific polynucleotide with 
5 30-40 nucleotides is synthesized using an Applied Biosystems DNA synthesizer 
according to the sequence reported. The oUgonucleotide is labeled, for instance, with 
■^^P-y-ATP using T4 polynucleotide kinase and purified according to routine methods. 
{See, e.g., Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Press, Cold Spring, NY (1982).) The library is transformed into a suitable 

10 host, as indicated above (such as XL-1 Blue (Stratagene)) using techniques known to 
those of skill in the art. See, e.g., Sambrook et al. MOLECULAR CLONING: A 
LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 1989); AusubeUt al., 
CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, 
N.Y. 1989). The transformants are plated on 1.5% agar plates (containing the 

15 appropriate selection agent, e.g., ampicillin) to a density of about 150 transformants 
(colonies) per plate. These plates are screened using Nylon membranes according to 
routine methods for bacterial colony screening. See, e.g., Sambrook et al. 
MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor, 
N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR 

20 BIOLOGY (John Wiley and Sons, N.Y. 1989) or other techniques known to those of 
skill in the art. 

Alternatively, two primers of 15-25 nucleotides derived from the 5' and 3* ends 
of a polynucleotide of Table 1 are synthesized and used to amplify the desired DNA 
by PCR using a £. faecalis genomic DNA prep as a template. PCR is carried out 
25 under routine conditions, for instance, in 25 \\\ of reaction mixture with 0.5 ug of the 
above DNA template. A convenient reaction mixture is 1.5-5 mM MgCl2, 0.01% 
(w/v) gelatin, 20 \M each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 
0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation at 94°C for 1 
min; araieahng at 55°C for 1 min; elongation at 72°C for 1 min) are performed with a 
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Perkin-Elmer Cetus automated thermal cycler. The ampHfied product is analyzed by 
agarose gel electrophoresis and the DNA band with expected molecular weight is 
excised and purified. The PCR product is verified to be the selected sequence by 
subcloning and sequencing the DNA product. 
5 Finally, overlapping oligos of the DNA sequences of Table 1 can be chemically 

synthesized and used to generate a nucleotide sequence of desired length using PCR 
methods known in the art. 

Example 2(a): Expression and Purification Enterococcal polypeptides in E. coli 

10 The bacterial expression vector pQE60 was used for bacterial expression of 

some of the polypeptide fragements used in the soft tissue and systemic infection 
models discussed below. (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311). pQE60 encodes ampiciUin antibiotic resistance ("Ampr") and contains a 
bacterial origin of replication ("ori"), an IPTG inducible promoter, a ribosome binding 

1 5 site ("RBS"), six codons encoding histidine residues that allow affinity purification 
using nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin (QIAGEN, Inc., supra) 
and suitable single restriction enzyme cleavage sites. These elements are arranged such 
that an inserted DNA fragment encoding a polypeptide expresses that polypeptide 
with the six His residues (i.e., a "6 X His tag") covalently linked to the carboxyl 

20 terminus of that polypeptide. 

The DNA sequence encoding the desired portion of a E, faecalis protein of the 
present invention was amplified fi*om E,faecalis genomic DNA using PCR 
oligonucleotide primers which anneal to the 5' and 3' sequences coding for the 
portions of the E,faecalis polynucleotide shown in Table 1. Additional nucleotides 

25 containing restriction sites to facilitate cloning in the pQE60 vector are added to the 5* 
and 3* sequences, respectively. 

For cloning the mature protein, the 5' primer has a sequence containing an 
appropriate restriction site followed by nucleotides of the amino terminal coding 
sequence of the desired E.faecalis polynucleotide sequence in Table 1. One of 
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ordinary skill in the art would appreciate that the point in the protein coding sequence 
where the 5' and 3' primers begin may be varied to amplify a DNA segment encoding 
any desired portion of the complete protein shorter or longer than the mature form. 
The 3' primer has a sequence containing an appropriate restriction site followed by 
5 nucleotides complementary to the 3* end of the polypeptide coding sequence of Table 
1, excluding a stop codon, with the coding sequence aligned with the restriction site so 
as to maintain its reading frame with that of the six His codons in the pQE60 vector. 

The amplified E, faecalis DNA fragment and the vector pQE60 were digested 
v^ath restriction enzymes which recognize the sites in the primers and the digested 

1 0 DNAs were then ligated together. The £. faecalis DNA was inserted into the 

restricted pQE60 vector in a manner which places the E, faecalis protein coding region 
downstream from the IPTG-inducible promoter and in- frame with an initiating AUG 
and the six histidine codons. 

The ligation mixture was transformed into competent E, colt cells using 

15 standard procedures such as those described by Sambrook et al., supra.. E. coli strain 
M15/rep4, containing multiple copies of the plasmid pREP4, which expresses the lac 
repressor and confers kanamycin resistance ("Kanr"), was used in carrying out ttie 
illustrative example described herein. This strain, which was only one of many that 
are suitable for expressing a E. faecalis polypeptide, is available conunercially 

20 (QIAGEN, Inc., supra). Transformants were identified by their ability to grow on LB 
agar plates in the presence of ampicillin and kanamycin. Plasmid DNA was isolated 
from resistant colonies and the identity of the cloned DNA confirmed by restriction 
analysis, PGR and DNA sequencing. 

Clones containing the desired constructs were grown overnight ("0/N") in 

25 liquid culture in LB media supplemented with both ampicillin (1 00 flg/ml) and 
kanamycin (25 ^ig/ml). The 0/N culture was used to inoculate a large culture, at a 
dilution of approximately 1:25 to 1:250. The cells were grown to an optical density at 
600 rmi ("OD600") of between 0.4 and 0.6. Isopropyl-P-D-thiogalactopyranoside 
("IPTG") was then added to a final concentration of 1 mM to induce transcription 
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from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells 
subsequently were incubated further for 3 to 4 hours. Cells then were harvested by 
centrifugation. 

The cells were then stirred for 3-4 hours at 4°C in 6M guanidine-HCl, pH 8. 
5 The cell debris was removed by centrifugation, and the supernatant containing the E. 
faecalis polypeptide was loaded onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") 
affinity resin column (QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the 
Ni-NTA resin with high affinity were purified in a simple one-step procedure (for 
details see: The QIAexpressionist, 1995, QIAGEN, Inc., supra). Briefly the 
10 supernatant was loaded onto the column in 6 M guanidine-HCl, pH 8, the column was 
first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed with 10 
volumes of 6 M guanidine-HCl pH 6, and finally the E. faecalis polypeptide was 
eluted with 6 M guanidine-HCl, pH 5. 

The purified protein was then renatured by dialyzing it against 
15 phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM 
NaCL Altematively, the protein could be successfiiUy refolded while immobilized on 
the Ni-NTA column. The recommended conditions are as follows: renature using a 
linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, 
containing protease inhibitors. The renaturation should be performed over a period of 
20 1 .5 hours or more. After renaturation the proteins can be eluted by the addition of 
250 mM immidazole, Immidazole was removed by a final dialyzing step against PBS 
or 50 mM sodium acetate pH 6 buffer plus 200 mM NaCl. The purified protein was 
stored at 4° C or frozen at -80° C, 

Some of the polypeptide of the present invention were prepared using a non- 
25 denaturing protein purification method. For these polypeptides, the cell pellet from 
each liter of culture was resuspended in 25 mis of Lysis Buffer A at 4°C (Lysis Buffer 
A = 50 mM Na-phosphate, 300 mM NaCl, 10 mM 2-mercaptoethanol, 10% 
Glycerol, pH 7.5 with 1 tablet of Complete EDTA-free protease inhibitor cocktail 
(Boehringer Mannheim #1873580) per 50 ml of buffer). Absorbance at 550 nm was 
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approximately 10-20 O.D./ml. The suspension was then put through three 
freeze/thaw cycles from -70^C (using a ethanol-dry ice bath) up to room temperature. 
The cells were lysed via sonication in short 10 sec bursts over 3 minutes at 
approximately SOW while kept on ice. The sonicated sample was then centrifuged at 
5 15,000 RPM for 30 minutes at 4°C. The supernatant was passed through a column 
containing 1 .0 ml of CL-4B resin to pre-clear the sample of any proteins that may 
bind to agarose non-specifically, and the flow-through fraction was collected. 

The pre-cleared flow-through was apphed to a nickel-nitrilo-tri-acetic acid 
('*Ni-NTA") affinity resin column (Quiagen, Inc., supra). Proteins with a 6 X His tag 

1 0 bind to the Ni-NTA resin with high affinity and can be purified in a simple one-step 
procedure. Briefly, the supernatant was loaded onto the column in Lysis Buffer A al 
4**C, the column was first washed with 10 volumes of Lysis Buffer A until the A280 
of the eluate returns to the baseline. Then, the column was washed with 5 volumes of 
40 mM Imidazole (92% Lysis Buffer A / 8% Buffer B) (Buffer B = 50 mM Na- 

15 Phosphate, 300 mM NaCl, 10% Glycerol, 10 mM 2-mercaptoethanol, 500 mM 
Imidazole, pH of the final buffer should be 7.5). The protein was eluted off of the 
column with a series of increasing Imidazole solutions made by adjusting the ratios of 
Lysis Buffer A to Buffer B. Three different concentrations were used: 3 volumes of 
75 mM Imidazole, 3 volumes of 150 mM Imidazole, 5 volumes of 500 mM 

20 Imidazole. The fractions containing the purified protein were analyzed using 8 %, 10 
% or 14% SDS-PAGE depending on the protein size. The purified protein was then 
dialyzed 2X against phosphate-buffered saline (PBS) in order to place it into an easily 
workable buffer. The purified protein was stored at 4** C or frozen at -80°. 

The following alternative method may be used to purify £. faecalis expressed 

25 in E coll when it is present in the form of inclusion bodies. Unless otherwise 
specified, all of the following steps are conducted at 4-10°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-1 0°C and the cells are harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per 
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unit weight of cell paste and the amount of purified protein required, an appropriate 
amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM 
Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 
5 The cells are then lysed by passing the solution through a microfluidizer 

(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 
is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 x g for 1 5 min. The resultant pellet is washed again using 0.5M 
NaCl, 1 00 mM Tris, 50 mM EDTA, pH 7.4. 

10 The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine 

hydrochloride (GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the 
pellet is discarded and the E.faecalis polypeptide-containing supernatant is incubated 
at 4°C overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, 

15 the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 1 50 mM NaCl, 2 mM 
EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4°C 
without mixing for 12 hours prior to further purification steps. 

To clarify the refolded E. faecalis polypeptide solution, a previously prepared 

20 tangential filtration unit equipped with 0.16 (xm membrane filter with appropriate 
surface area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6,0 is 
employed. The filtered sample is loaded onto a cation exchange resin (e.g., Poros HS- 
50, Perseptive Biosystems), The column is washed with 40 mM sodium acetate, pH 
6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same 

25 buffer, in a stepwise maimer. The absorbance at 280 mm of the effluent is 

continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the E.faecalis polypeptide are then pooled and mixed 
with 4 volumes of water. The diluted sample is then loaded onto a previously 
prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive 



wo 98/50554 



-66- 



PCT/US98/089S9 



Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) exchange resins. 
The columns are equihbrated with 40 mM sodium acetate, pH 6.0. Both columns are 
washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is 
then eluted using a 10 column volume linear gradient ranging from 0,2 M NaCl, 50 
5 mM sodium acetate, pH 6.0 to 1 .0 M NaCl, 50 mM sodium acetate, pH 6.5. 
Fractions are collected under constant A280 monitoring of the effluent. Fractions 
containing the E,faecalis polypeptide (determined, for instance, by 16% SDS-PAGE) 
are then pooled. 

The resultant E. faecalis polypeptide exhibits greater than 95% purity after 
10 the above refolding and purification steps. No major contaminant bands are observed 
from Commassie blue stained 16% SDS-PAGE gel when 5 fxg of purified protein is 
loaded. The purified protein is also tested for endotoxin/LPS contamination, and 
typically the LPS content is less than 0. 1 ng/ml according to LAL assays. 

1 5 Example 2(b): Alternative Expression and Purification Enterococcal polypeptides in E. 
cqli 

Tthe vector pQElO was alternatively used to clone and express some of the 
polypeptides of the present invention for use in the soft tissue and systemic infection 
models discussed below. The difference being such that an inserted DNA fragment 

20 encoding a polypeptide expresses that polypeptide with the six His residues (i.e., a "6 
X His tag") covalently linked to the amino terminus of that polypeptide. The bacterial 
expression vector pQElO (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311) was used in this example . The components of the pQElO plasmid are 
arranged such that the inserted DNA sequence encoding a polypeptide of the present 

25 invention expresses the polypeptide with the six His residues (/.e., a "6 X His tag")) 
covalently linked to the amino terminus. 

The DNA sequences encoding the desired portions of a polypeptide of Table 
1 were amplified using PCR oligonucleotide primers from genomic E. faecalis DNA. 
The PCR primers anneal to the nucleotide sequences encoding the desired amino acid 
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sequence of a polypeptide of the present invention. Additional nucleotides containing 
restriction sites to facilitate cloning in the pQElO vector were added to the 5' and 3* 
primer sequences, respectively. 

For cloning a polypeptide of the present invention, the 5' and 3* primers were 
5 selected to amplify their respective nucleotide coding sequences. One of ordinary skill 
in the art would appreciate that the point in the protein coding sequence where the 5' 
and 3' primers begins may be varied to amplify a DNA segment encoding any desired 
portion of a polypeptide of the present invention. The 5' primer was designed so the 
coding sequence of the 6 X His tag is aligned with the restriction site so as to maintain 
10 its reading frame with that of E.faecalis polypeptide. The 3* was designed to include 
an stop codon. The amplified DNA fragment was then cloned, and the protein 
expressed, as described above for the pQE60 plasmid. 

The DNA sequences encoding the amino acid sequences of Table 1 may also 
be cloned and expressed as fusion proteins by a protocol similar to that described 
15 directly above, wherein the pET-32b(+) vector (Novagen, 601 Science Drive, 
Madison, WI 5371 1) is preferentially used in place of pQElO. 

The above methods are not limited to the polypeptide fragements actually 
produced. The above method, like the methods below, can be used to produce either 
full length polypeptides or desired fragements therof. 

20 

Example 2(c): Alternative Expression and Purification of Enterococcal polypeptides 
in E, coli 

The bacterial expression vector pQE60 is used for bacterial expression in this 
example (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 9131 1), However, in 
25 this example, the polypeptide coding sequence is inserted such that translation of the 
six His codons is prevented and, therefore, the polypeptide is produced with no 6 X 
His tag. 

The DNA sequence encoding the desired portion of the E.faecalis amino acid 
sequence is amplified from an E.faecalis genomic DNA prep the deposited DNA 



wo 98/50554 



-68- 



PCT/US98/089S9 



clones using PGR oligonucleotide primers which anneal to the 5' and 3' nucleotide 
sequences corresponding to the desired portion of the E, faecalis polypeptides. 
Additional nucleotides containing restriction sites to facilitate cloning in the pQE60 
vector are added to the 5' and 3* primer sequences. 

5 For cloning a E. faecalis polypeptides of the present invention, 5' and 3' 

primers are selected to amplify their respective nucleotide coding sequences. One of 
ordinary skill in the art would appreciate that the point in the protein coding sequence 
where the 5* and 3' primers begin may be varied to amplify a DNA segment encoding 
any desired portion of a polypeptide of the present invention. The 3' and 5' primers 

10 contain appropriate restriction sites followed by nucleotides complementary to the 5' 
and 3' ends of the coding sequence respectively. The 3' primer is additionally designed 
to include an in-frame stop codon. 

The amplified E. faecalis DNA fragments and the vector pQE60 are digested 
with restriction enzymes recognizing the sites in the primers and the digested DNAs 

1 5 are then ligated together. Insertion of the E, faecalis DNA into the restricted pQE60 
vector places the E. faecalis protein coding region including its associated stop codon 
downstream from the IPTG-inducible promoter and in-frame with an initiating AUG. 
The associated stop codon prevents translation of the six histidine codons 
downstream of the insertion point. 

20 The ligation mixture is transformed into competent E. coli cells using standard 

procedures such as those described by Sambrook et al. £. coli strain M15/rep4, 
containing multiple copies of the plasmid pREP4, which expresses the lac repressor 
and confers kanamycin resistance ("Kanr"), is used in carrying out the illustrative 
example described herein. This strain, which is only one of many that are suitable for 

25 expressing E, faecalis polypeptide, is available commercially (QIAGEN, Inc., supra). 
Transformants are identified by their ability to grow on LB plates in the presence of 
ampicillin and kanamycin. Plasmid DNA is isolated from resistant colonies and the 
identity of the cloned DNA confirmed by restriction analysis, PGR and DNA 
sequencing. 
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Clones containing the desired constructs are grown overnight ("0/N") in liquid 
culture in LB media supplemented with both ampicillin (100 flg/ml) and kanamycin 
(25 M.g/ml). The 0/N culture is used to inoculate a large culture, at a dilution of 
approximately 1 :25 to 1 :250. The cells are grown to an optical density at 600 nm 
5 ("OD600") of between 0.4 and 0.6. isopropyl-b-D-thiogalactopyranoside ("IPTG") is 
then added to a final concentration of 1 mM to induce transcription from the lac 
repressor sensitive promoter, by inactivating the lacl repressor. Cells subsequently 
are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation. 

To purify the E. faecalis polypeptide, the cells are then stirred for 3-4 hours at 
10 4°C in 6M guanidine-HCl, pH 8. The cell debris is removed by centrifugation, and the 
supernatant containing the E, faecalis polypeptide is dialyzed against 50 mM Na- 
acetate buffer pH 6, supplemented with 200 mM NaCl. Alternatively, the protein 
can be successfully refolded by dialyzing it against 500 mM NaCl, 20% glycerol, 25 
mM Tris/HCl pH 7.4, containing protease inhibitors. After renaturation the protein 
1 5 can be purified by ion exchange, hydrophobic interaction and size exclusion 

chromatography. Alternatively, an affinity chromatography step such as an antibody 
column can be used to obtain pure E, faecalis polypeptide. The purified protein is 
stored at 4"* C or frozen at -80'' C. 

The following alternative method may be used to purify E. faecalis 
20 polypeptides expressed in E coli when it is present in the form of inclusion bodies. 
Unless otherwise specified, all of the following steps are conducted at 4-10°C. 

Upon completion of the production phase of the E, coli fermentation, the cell 
culture is cooled to 4-1 O^C and the cells are harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per 
25 unit weight of cell paste and the amount of purified protein required, an appropriate 
amount of cell paste, by weight, is suspended in a buffer solution containing 1 00 mM 
Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 

The cells ware then lysed by passing the solution through a microfluidizer 
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(Microfiiidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 
is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 x g for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4, 
5 The resulting washed inclusion bodies are solubilized with 1 .5 M guaaidine 

hydrochloride (GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the 
pellet is discarded and the E.faecalis polypeptide-containing supernatant is incubated 
at 4°C overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, 

10 the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM 
EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4°C 
without mixing for 12 hours prior to further purification steps. 

To clarify the refolded E, faecalis polypeptide solution, a previously prepared 

15 tangential filtration unit equipped with 0. 1 6 |im membrane filter with appropriate 
surface area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is 
employed. The filtered sample is loaded onto a cation exchange resin (e.g., Poros HS- 
50, Perseptive Biosystems). The column is washed with 40 mM sodium acetate, pH 
6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same 

20 buffer, in a stepwise manner. The absorbance at 280 mm of the effluent is 

continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the E, faecalis polypeptide are then pooled and mixed 
with 4 volumes of water. The diluted sample is then loaded onto a previously 
prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive 

25 Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) exchange resins. 
The columns are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are 
washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 colunm is 
then eluted using a 1 0 column volume linear gradient ranging from 0.2 M NaCl, 50 
mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. 



wo 98^0554 



-71- 



PCT/US98/08959 



Fractions are collected under constant A2go monitoring of the effluent. Fractions 
containing the E.faecalis polypeptide (determined, for instance, by 16% SDS-PAGE) 
are then pooled. 

The resultant E, faecalis polypeptide exhibits greater than 95% purity after 
5 the above refolding and purification steps. No major contaminant bands are observed 
from Commassie blue stained 16% SDS-PAGE gel when 5 |ag of purified protein is 
loaded. The purified protein is also tested for endotoxin/LPS contamination, and 
typically the LPS content is less than 0. 1 ng/ml according to LAL assays. 

1 0 Example 2( d) : Cloning and Expression of E. faecalis in Other Bacteria 

E.faecalis polypeptides can also be produced in: E.faecalis using the methods 
of S. Skinner et al., (1988) Mol. Microbiol. 2:289-297 or J. I. Moreno (1996) Protein 
Expr. Purif 8(3):332-340; Lactobacillus using the methods of C. Rush et al., 1997 
Appl. Microbiol. Biotechnol. 47(5):537-542; or in Bacillus subtilis using the methods 

15 Chang et al., U.S. Patent No. 4,952,508. 

Example 3: Cloning and Expression in COS Cells 

A E, faecalis expression plasmid is made by cloning a portion of the DNA 
encoding a E.faecalis polypeptide into the expression vector pDNAI/Amp or 

20 pDNAIII (which can be obtained fix)m Invitrogen, Inc.). The expression vector 

pDNAI/amp contains: (1) an E. coli origin of replication effective for propagation in 
E, coli and other prokaryotic cells; (2) an ampicillin resistance gene for selection of 
plasmid-containing prokaryotic cells; (3) an SV40 origin of replication for propagation 
in eukaryotic cells; (4) a CMV promoter, a polylinker, an SV40 intron; (5) several 

25 codons encoding a hemagglutinin fiagment (i.e., an "HA" tag to facilitate purification) 
followed by a termination codon and polyadenylation signal arranged so that a DNA 
can be conveniently placed under expression control of the CMV promoter and 
operably linked to the SV40 intron and the polyadenylation signal by means of 
restriction sites in the polylinker. The HA tag corresponds to an epitope derived 
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from the influenza hemagglutinin protein described by Wilson et al. 1984 Cell 37:167. 
The fusion of the HA tag to the target protein allows easy detection and recovery of 
the recombinant protein with an antibody that recognizes the HA epitope. pDNAIII 
contains, in addition, the selectable neomycin marker. 
5 A DNA fragment encoding a E.faecalis polypeptide.is cloned into the 

poly linker region of the vector so that recombinant protein expression is directed by 
the CMV promoter. The plasmid construction strategy is as follows. The DNA from 
a E,faecalis genomic DNA prep is amplified using primers that contain convenient 
restriction sites, much as described above for construction of vectors for expression of 

10 E, faecalis in E. coli. The 5' primer contains a Kozak sequence, an AUG start codon, 
and nucleotides of the 5' coding region of the E, faecalis polypeptide. The 3' primer, 
contains nucleotides complementary to the 3' coding sequence of the E, faecalis DNA, 
a stop codon, and a convenient restriction site. 

The PGR amplified DNA fragment and the vector, pDNAI/Amp, are digested 

15 with appropriate restriction enzymes and then ligated. The ligation mixture is 
transformed into an appropriate E. coli strain such as SURE"^"^ (Stratagene Gloning 
Systems, La Jolla, CA 92037), and the transformed culture is plated on ampicillin 
media plates which then are incubated to allow growth of ampicillin resistant colonies. 
Plasmid DNA is isolated from resistant colonies and examined by restriction analysis 

20 or other means for the presence of the fragment encoding the ^.yaeca/w polypeptide 
For expression of a recombinant E, faecalis polypeptide, GOS cells are 
transfected with an expression vector, as described above, using DEAE-dextran, as 
described, for instance, by Sambrook et al. (supra), Gells are incubated under 
conditions for expression of E. faecalis by the vector. 

25 Expression of the E, faecalis-HA fusion protein is detected by radiolabeling 

and immunoprecipitation, using methods described in, for example Harlow et al., 
supra,. To this end, two days after transfection, the cells are labeled by incubation in 
media containing ^^S-cysteine for 8 hours. The cells and the media are collected, and 
the cells are washed and the lysed with detergent-containing RIPA buffer: 1 50 mM 
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NaCl, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% DOC, 50 mM TRIS, pH 7.5, as 
described by Wilson et al. {supra ). Proteins are precipitated from the cell lysate and 
from the culture media using an HA-specific monoclonal antibody. The precipitated 
proteins then are analyzed by SDS-PAGE and autoradiography. An expression 
5 product of the expected size is seen in the cell lysate, which is not seen in negative 
controls. 

Example 4: Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of E. faecalis polypeptide in this 

10 example. Plasmid pC4 is a derivative of the plasmid pSV2-dhfr (ATCC Accession 
No. 37146). The plasmid contains the mouse DHFR gene under control of the SV40 
early promoter. Chinese hamster ovary cells or other cells lacking dihydrofolate 
activity that are transfected with these plasmids can be selected by growing the cells 
in a selective medium (alpha minus MEM, Life Technologies) supplemented with the 

1 5 chemotherapeutic agent methotrexate. The amplification of the DHFR genes in cells 
resistant to methotrexate (MTX) has been well documented. See, e.g., Alt et al., 
1978, J. Biol. Chem. 253:1357-1370; Hamlin et al., 1990, Biochem, et Biophys. Acta, 
1097:107-143; Page etal, 1991, Biotechnology 9:64-68, Cells grown in increasing 
concentrations of MTX develop resistance to the dmg by overproducing the target 

20 enzyme, DHFR, as a result of amplification of the DHFR gene. If a second gene is 
linked to the DHFR gene, it is usually co-amplified and over-expressed. It is knovra 
in the art that this approach may be used to develop cell lines carrying more than 
1 ,000 copies of the amplified gene(s). Subsequently, when the methotrexate is 
withdrawn, cell lines are obtained which contain the amplified gene integrated into one 

25 or more chromosome(s) of the host cell. 

Plasmid pC4 contains the strong promoter of the long terminal repeat (LTR) 
of the Rouse Sarcoma Virus, for expressing a polypeptide of interest, Cullen, et al. 
(1985) Mol. Cell. Biol. 5:438-447; plus a fragment isolated from the enhancer of the 
immediate early gene of himian cytomegalovirus (CMV), Boshart, et al., 1985, Cell 
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41 :521-530. Downstream of the promoter are the following single restriction enzyme 
cleavage sites that allow the integration of the genes: Bam HI, Xba I, and Asp 718. 
Behind these cloning sites the plasmid contains the 3* intron and polyadenylation site 
of the rat preproinsulin gene. Other high efficiency promoters can also be used for the 

5 expression, e.g., the human JJ-actin promoter, the SV40 early or late promoters or the 
long terminal repeats from other retroviruses, e.g., HIV and HTLVI. Clontech's Tet- 
Off and Tet-On gene expression systems and similar systems can be used to express 
the E. faecalis polypeptide in a regulated way in mammalian cells (Gossen et al., 1992, 
Proc. Natl. Acad. Sci. USA 89:5547-5551. For the polyadenylation of the mRNA 

1 0 other signals, e.g., from the human growth hormone or globin genes can be used as 
well. Stable cell lines carrying a gene of interest integrated into the chromosomes can 
also be selected upon co-transfection with a selectable marker such as gpt, G418 or 
hygromycin. It is advantageous to use more than one selectable marker in the 
beginning, e.g., 04 1 8 plus methotrexate. 

15 The plasmid pC4 is digested with the restriction enzymes and then 

dephosphorylated using calf intestinal phosphates by procedures known in the art. 
The vector is then isolated from a 1% agarose gel. The DNA sequence encoding the E. 
faecalis polypeptide is amplified using PGR oligonucleotide primers corresponding to 
the 5' and 3' sequences of the desired portion of the gene. A 5* primer containing a 

20 restriction site, a Kozak sequence, an AUG start codon, and nucleotides of the 5' 
coding region of the E, faecalis polypeptide is synthesized and used. A 3' primer, 
containing a restriction site, stop codon, and nucleotides complementary to the 3* 
coding sequence of the E. faecalis polypeptides is synthesized and used. The 
amplified fragment is digested wdth the restriction endonucleases and then purified 

25 again on a 1% agarose gel. The isolated fragment and the dephosphorylated vector are 
then hgated with T4 DNA ligase. £. coli HBlOl or XL-1 Blue cells are then 
transformed and bacteria are identified that contain the fragment inserted into plasmid 
pC4 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene are used for 
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transfection. Five |ig of the expression plasmid pC4 is cotransfected with 0.5 \ig of 
the plasmid pSVneo using a Upid-mediated transfection agent such as Lipofectin"^'^ or 
LipofectAMINE (LifeTechnologies Gaithersburg, MD). The plasmid pSV2-neo 
contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme 

5 that confers resistance to a group of antibiotics including G418, The cells are seeded 
in alpha minus MEM supplemented with 1 mg/ml 0418. After 2 days, the cells are 
trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha 
minus MEM supplemented with 10, 25, or 50 ng/ml of methotrexate plus 1 mg/ml 
G418. After about 10-14 days single clones are trypsinized and then seeded in 6-well 

10 petri dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 

100 nM, 200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 
methotrexate are then transferred to new 6-well plates containing even higher 
concentrations of methotrexate (1 ^iM, 2 |iM, 5 |LtM, 10 mM, 20 mM). The same 
procedure is repeated until clones are obtained which grow at a concentration of 

15 100-200 ^iM. Expression of the desired gene product is analyzed, for instance, by 
SDS-PAGE and Western blot or by reversed phase HPLC analysis. 

Example 5: Quantitative Murine Soft Tissue Infection Model for E. faecalis 

Compositions of the present invention, including polypeptides and peptides, 

20 are assayed for their ability to function as vaccines or to enhance/stimulate an immune 
response to a bacterial species (e.g., E, faecalis) using the following quantitative 
murine soft tissue infection model. Mice (e.g., NIH Swiss female mice, approximately 
7 weeks old) are first treated with a biologically protective effective amount, or 
immune enhancing/stimulating effective amount of a composition of the present 

25 invention using methods known in the art, such as those discussed above. See,e,g., 
Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor 
Laboratory Press, 2nd ed. 1988). An example of an appropriate starting dose is 20ug 
per animal. 



J 
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The desired bacterial species used to challenge the mice, such as E.faecalis, is 
grown as an overnight culture. The culture is diluted to a concentration of 5 X 10^ 
cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired 
doses are further diliuted 1 :2 with sterilized Cytodex 3 microcarrier beads preswoUen 
5 in sterile PBS (3g/100ml). Mice are anesthetize briefly imtil docile, but still mobile 
and injected with 0.2 ml of the Cytodex 3 bead/bacterial mixture into each animal 
subcutaneously in the inguinal region. After four days, counting the day of injection 
as day one, mice are sacrificed and the contents of the abscess is excised and placed in 
a 15 ml conical tube containing 1.0ml of sterile PBS. The contents of the abscess is 

10 then enzymatically treated and plated as follows. 

The abscess is first disrupted by vortexing with sterilized glass beads placed in 
the tubes. 3.0mls of prepared enzyme mixture (1.0ml CoUagenase D (4.0 mg/ml), 
1 .0ml Trypsin (6.0 mg/ml) and 8.0 mis PBS) is then added to each tube followed by a 
20 min. incubation at 37C. The solution is then centrifuged and the supernatant 

15 drawn off. 0.5 ml dH20 is then added and the tubes are vortexed and then incubated 
for 10 min. at room temperature. 0.5 ml media is then added and samples are serially 
diluted and plated onto agar plates, and grown overnight at 37C. Plates with distinct 
and separate colonies are then counted, compared to positive and negative control 
samples, and quantified. The method can be used to identify composition and 

20 determine appropriate and effective doses for humans and other animals by comparing 
the effective doses of compositions of the present invention with compositions 
known in the art to be effective in both mice and humans. Doses for the effective 
treatment of humans and other animals, using compositions of the present invention, 
are extrapolated using the data from the above experiments of mice. It is appreciated 

25 that further studies in humans and other animals may be needed to determine the most 
effective doses using methods of clinical practice known in the art. 

Example 6: Murine Systemic Neutropenic Model for E. faecalis Infection 
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Compositions of the present invention, including polypeptides and peptides, 
are assayed for their ability to function as vaccines or to enhance/stimulate an immune 
response to a bacterial species (e.g., E.faecalis) using the foUoAving qualitative murine 
systemic neutropenic model. Mice (e.g., NIH Sv^ss female mice, approximately 7 

5 weeks old) are first treated with a biologically protective effective amount, or immune 
enhancing/stimulating effective amount of a composition of the present invention 
using methods known in the art, such as those discussed above. See,e.g,, Harlow et 
al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory 
Press, 2nd ed. 1 988). An example of an appropriate starting dose is 20ug per animal. 

10 Mice are then injected with 250 - 300 mg/kg cyclophosphamide intraperitonially. 

Counting the day of CP. injection as day one, the mice are left imtreated for 5 days to 
begin recovery of PMNUS. 

The desired bacterial species used to challenge the mice, such as E.faecalis, is 
grown as an overnight culture. The culture is diluted to a concentration of 5 X 10^ 

15 cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired 
doses are further diliuted 1 :2 in 4% Brewer's yeast in media. 
Mice are injected with the bacteria/brewer's yeast challenge intraperitonially. The 
Brewer's yeast solution alone is used as a control. The mice are then monitered twice 
daily for the first week following challenge, and once a day for the next week to 

20 ascertain morbidity and mortality. Mice remaining at the end of the experiment are 
sacrificed. The method can be used to identify compositions and determine 
appropriate and effective doses for humans and other animals by comparing the 
effective doses of compositions of the present invention with compositions known in 
the art to be effective in both mice and humans. Doses for the effective treatment of 

25 humans and other animals, using compositions of the present invention, are 

extrapolated using the data from the above experiments of mice. It is appreciated that 
further studies in humans and other animals may be needed to determine the most 
effective doses using methods of clinical practice known in the art. 
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The disclosure of all publications (including patents, patent applications, 
journal articles, laboratory manuals, books, or other documents) cited herein are 
hereby incorporated by reference in their entireties. 

The present invention is not to be limited in scope by the specific 
5 embodiments described herein, which are intended as single illustrations of individual 
aspects of the invention. Functionally equivalent methods and components are within 
the scope of the invention, in addition to those shown and described herein and will 
become apparant to those skilled in the art from the foregoing description and 
accompanying drawings. Such modifications are intended to fall within the scope of 
10 the appended claims. 



wo 98/50554 



PCT/US98/08959 



79 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 
EFOOl-l (SEQ ID N0:1) 

TGAAAGAATA TTGCCAGAAC GTGGCGAGCA AATTGTTTTA TAAATTTTTT TAAGGGAGAG 
hAAAAAATGA AGTTCAAAAC TCTAGCAACA ACAGTGTTAG CAACCGCAGC TATTTTCGCA 
TTGGGGGCTT GTGGTAACGG TAATGGGGCC AAAGAATCAA ACGATATTGT GAAAGAAGTG 
AAGGAAGATA CGACAATCAC TTTCTGGCAT GCAATGAATG GGGTTCAAGA AGAAGCGTTA 
ACAAAATTAA CGAAAGACTT CATGAAAGAA AATCCAAAAA TTAAAGTGGA ATTACAAAAT 
CAATCTGCTT ACCCTGATTT ACAAGCCAAA ATCAATTCGA CTTTAACTTC ACCAAAAGAT 
TTACCAACAA TTACGCAAGC GTACCCAGGC TGGTTATGGA ATGCTGCACA AGATGAAATG 
TTAGTGGACT TAAAACCATA TATGGATGAT GACACAATCG GCTGGAAAGA TGCAGAGCCA 
ATTCGTGAAG TATTGTTAGA CGGCGCCAAA ATCGACGGCA AACAATACGG CATTCCATTT 
AATAAATCGA CAGAAaTGTT ATTCTATAAT GCTGATTTGT TGAAAGAATA TGGTGTTGAA 
GTACCGAAAA CATTAGAGGA ATTAAAAGAA GCTTCTAAAA CAATTTACGA AAAATCCAAC 
AAAGAAGTCG TTGGTGCTCG TTTTGACTCG TTAAATAACT ATTACGCAAT TGGAATGAAA 
AACAAAGGCG TTGATTTTAA TAAAGACTTA GATTTAACAA GCAAAGATTC ACAAGAAGTC 
GTGGACTATT ACCGTGATGG TATCGAAGCA GGTTACTTCC GCACAGCTGG TTCAGATAAA 
TATTTATCTG GCCCATTTGC AAACAAAAAG GTAGCAATGT TTGTCGGTAG TATTGCTGGT 
GCTGGTTTTG TTCAAAAAGA TGCTGAAGCT GGTGGCTATG AATACGGTGT TGCACCACGT 
CCTGAAAAAA TCAACTTACA ACAAGGAACA GATATTTATA TGTTCGATAG TGCTACGCCA 
GAACAACGGA CAGCGGCATT TGAATTCATG AAATTCTTAG CTACTCCTGA TTCACAATTG 
TACTGGGCAC AACAAACAGG TTATATGCCA ATTTTAGAAT CTGTTTTACA CAGTGATGAG 
TACAAAAATT CTAAGACAAC CAAAGTACCT GCACAACTTG AAAACGCAGT AAAAGATTTA 
TTCGCTATCC CAGTAGAAGA AAATGCTGAT TCAGCCTATA ATGAAATGCG GACAATTATG 
GAAAGTATTT TTGCTTCATC AAATAAAGAC ACGAGAAAAT TATTGAAAGA TGCAACATCA 
CAATTTGAAC AAGCATGGAA CCAATAA 



EFOOl-2 (SEQ ID N0:2) 



MKFKTLATT VLATAAIFAL GACGNGNGAK ESNDIVKEVK 

EDTTITFWHA MNGVQEEALT KLTKDFMKEN PKIKVELQNQ SAYPDLQAKI NSTLTSPKDL 
PTITQAYPGW LWNAAQDEML VDLKPYMDDD TIGWKDAEPI REVLLDGAKI DGKQYGIPFN 
KSTEMLFYNA DLLKEYGVEV PKTLEELKEA SKTIYEKSNK EWGAGFDSL NNYYAIGMKN 
KGVDFNKDLD LTSKD3QEW DYYRDGIEAG YFRTAGSDKY LSGPFANKKV AMFVGSIAGA 
GFVQKDAEAG GYEYGVAPRP EKINLQQGTD lYMFDSATPE QRTAAFEFMK FLATPDSQLY 
WAQQTGYMPI LESVLHSDEY KNSKTTKVPA QLENAVKDLF ' AIPVEENADS AYNEMRTIME 
SIFASSNKDT RKLLKDATSQ FEQAWNQ 



EFOOl-3 (SEQ ID N0:3) 

TT GTGGTAACGG TAATGGGGCC AAAGAATCAA ACGATATTGT GAAAGAAGTG 
AAGGAAGATA CGACAATCAC TTTCTGGCAT GCAATGAATG GGGTTCAAGA AGAAGCGTTA 
ACAAAATTAA CGAAAGACTT CATGAAAGAA AATCCAAAAA TTAAAGTGGA ATTACAAAAT 
CAATCTGCTT ACCCTGATTT ACAAGCCAAA ATCAATTCGA CTTTAACTTC ACCAAAAGAT 
TTACCAACAA TTACGCAAGC GTACCCAGGC TGGTTATGGA ATGCTGCACA AGATGAAATG 
TTAGTGGACT TAAAACCATA TATGGATGAT GACACAATCG GCTGGAAAGA TGCAGAGCCA 
ATTCGTGAAG TATTGTTAGA CGGCGCCAAA ATCGACGGCA AACAATACGG CATTCCATTT 
AATAAATCGA CAGAAATGTT ATTCTATAAT GCTGATTTGT TGAAAGAATA TGGTGTTGAA 
GTACCGAAAA CATTAGAGGA ATTAAAAGAA GCTTCTAAAA CAATTTACGA AAAATCCAAC 
AAAGAAGTCG TTGGTGCTGG TTTTGACTCG TTAAATAACT ATTACGCAAT TGGAATGAAA 
AACAAAGGCG TTGATTTTAA TAAAGACTTA GATTTAACAA GCAAAGATTC ACAAGAAGTC 
GTGGACTATT ACCGTGATGG TATCGAAGCA GGTTACTTCC GCACAGCTGG TTCAGATAAA 
TATTTATCTG GCCCATTTGC AAACAAAAAG GTAGCAATGT TTGTCGGTAG TATTGCTGGT 
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TABLE 1. Nucleotide and Amino Acid Scqeuences of E.faecalis Genes. 

GCTGGTTTTG TTCAAAAAGA TGCTGAAGCT GGTGGCTATG AATACGGTGT TGCACCAGGT 
CCTGAAAAAA TCAACTTACA ACAAGGAACA GATATTTATA TGTTCGATAG TGCTACGCCA 
GAACAACGGA CAGCGGCATT TGAATTCATG AAATTCTTAG CTACTCCTGA TTCACAATTG 
TACTGGGCAC AACAAACAGG TTATATGCCA ATTTTAGAAT CTGTTTTACA CAGTGATGAG 
TACAAAAATT CTAAGACAAC CAAAGTACCT GCACAACTTG AAAACGCAGT AAAAGATTTA 
TTCGCTATCC CAGTAGAAGA AAATGCTGAT TCAGCCTATA ATGAAATGCG GACAATTATG 
GAAAGTATTT TTGCTTCATC AAATAAAGAC ACGAGAAAAT TATTGAAAGA TGCAACATCA 
CAATTTGAAC AAGCATGGAA CCAA 



EFOOl-4 (SEQ ID NO : 4 ) 

CGNGNGAK ESNDIVKEVK 
EDTTITFWHA MNGVQEEALT KLTKDFjMKEN 
PTITQAYPGW LWNAAQDEML VDLKPYMDDD 
KSTEMLFYNA DLLKEYGVEV PKTLEELKEA 
KGVDFNKDLD LTSKDSQEW DYYRDGIEAG 
GFVQKDAEAG GYEYGVAPRP EKINLQQGTD 
WAQQTGYMPI LESVLHSDEY KNSKTTKVPA 
SIFASSNKDT RKLLKDATSQ FEQAWNQ 



PKIKVELQNQ SAYPDLQAKI NSTLTSPKDL 
TIGWKDAEPI REVLLDGAKI DGKQYGIPFN 
SKTIYEKSNK EWGAGFDSL NNYYAIGMKN 
YFRTAGSDKY LSGPFANKKV AMFVGSIAGA 
lYMFDSATPE QRTAAFEFMK FLATPDSQLY 
QLENAVKDLF AIPVEENADS AYNEMRTIME 



EF002-1 (SEQ ID N0:5) 

TAAATAGCGG AGGTAGTACA AATGAAATTT 
TTAGCAGTGG CGGCAGTAAC TTTAACAGCA 
GAAAAGAGTG AAGATGGCAA AACAAAATTA 
CCAGAATTTG AGAAATTATT CAGAGCTTTT 
CCGGTGGACA TTGCTTCAGA TGATTATGAC 
GATACGACGG ATATTTTAAC CATGAAAAAC 
AATCAATTGG TGGATTTAAC CGATCACGTT 
AGTTACGAGA TGTATGAAAT CGATGGTAAA 
TGGGTATTGT ATTACAATAA AAAAATGTTT 
TTAACTTGGG ATGAATATGA AGCGTTAGCG 
TATGGTGCCT ATCAACATAC TTGGCGCTCA 
AATGCCAATT TGATTGAACC AAAATACAAT 
AGAATGCAAA AAGATCAATC ACAAATGGAT 
TATCAATCAC AATTTGAAAA TTCAAAAGCG 
GGGACTTTAT TAACAAACAT TGATGATGGC 
ATACCACAAC AAGAAAAAGG CAAAGCAACT 
AATAAAAACA GTAAAAAACA AAAAGCTGCT 
GAAGGTGCAA AACTTTTAGC AGAAGTAGGG 
GATAAAATCT ACTTTGCAAG AAAAGGAATG 
ACCCAGATAC AATTAATTTA G 



TGGAAAAAAG GCTTAACAGC GGCAGGGCTG 
TGTGGTGGTT CAAGTGAAAA GAAAGCAACT 
ACAGTAACTA CTTGGAATTA TGACACGACC 
GAAGCGGAAA ATCCTGATAT CACTATTGAA 
ACAAAAGTAA CAACGATGCT TTCATCAOGA 
TTACTTTCAT ATTCTAATTA CGCGCTACGC 
AAAGATTTAG ATATCGAACC TGCCAAAGCA 
ACCTATGCTC AGCCTTACCG TACAGATTTC 
GATGAAGCCG GAATTGCCTA TCCCGATAAC 
AAAAAATTAT CTAAACCAGA AGAACAAGTA 
ACCGTTCAAG CGATTGCTGC TGCTCAAAAC 
TATATGGAAA CTTATTATGA TCGCGCATTG 
TTTGGAACAG CAAAATCAAC AAAAGTAACG 
GCGATGATGT ACATGGGTAG CTGGTACATG 
AAAACAAATG TCGAATGGGG GATTGCCGAA 
ACCTTTGGCT CACCGACAAG TTTTGCAATT 
CAAAAATTCT TAGACTTTGC TTCAGGTAAA 
GTGGTTCCTT CTTATAAAAC AGATGAAATT 
CCTTCAGACG AGTCTCACAA AAAGCCTTTA 



EF002-2 (SEQ ID NO: 6) 

MKFW KKGLTAAALL AVAAVTLTAC GGSSEKKATE KSEDGKTKLT VTTWNYDTTP 
EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTTMLSSGD TTDILTMKNL LSYSNYALRN 
QLVDLTDHVK DLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKKMFD EAGIAYPDNL 
TWDEYEALAK KLSKPEEQVY GAYQHTWRST VQAIAAAQNN ANLIEPKYNY METYYDRALR 
MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MMYMGSWYMG TLLTNIDDGK TNVEWGIAEI 
PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE GAKLLAEVGV VPSYKTDEID 
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KIYFARKGMP SDESHKKPLT QIQLI 
EF002-3 (SEQ ID N0:7) 



A TGTGGTGGTT CAAGTGAAAA GAAAGCAAC 
GAAAAGAGTG AAGATGGCAA PJ^QhPJiKVTK 
CCAGAATTTG AGAAATTATT CAGAGCTTTT 
CCGGTGGACA TTGCTTCAGA TGATTATGAC 
GATACGACGG ATATTTTAAC CATGAAAAAC 
AATCAATTGG TGGATTTAAC CGATCACGTT 
AGTTACGAGA TGTATGAAAT CGATGGTAAA 
TGGGTATTGT ATTACAATAA AAAAATGTTT 
TTAACTTGGG ATGAATATGA AGCGTTAGCG 
TATGGTGCCT ATCAACATAC TTGGCGCTCA 
AATGCCAATT TGATTGAACC AAAATACAAT 
AGAATGCAAA AAGATCAATC ACAAATGGAT 
TATCAATCAC AATTTGAAAA TTCAAAAGCG 
GGGACTTTAT TAACAAACAT TGATGATGGC 
ATACCACAAC AAGAAAAAGG CAAAGCAACT 
AATAAAAACA GTAAAAAACA AAAAGCTGCT 
GAAGGTGCAA AACTTTTAGC AGAAGTAGGG 
GATAAAATCT ACTTTGCAAG AAAAGGAATG 
ACCCAGATAC AATTAATT 



ACAGTAACTA CTTGGAATTA TGACACGACC 
GAAGCGGAAA ATCCTGATAT CACTATTGAA 
ACAAAAGTAA CAACGATGCT TTCATCAGGA 
TTACTTTCAT ATTCTAATTA CGCGCTACGC 
AAAGATTTAG ATATCGAACC TGCCAAAGCA 
ACCTATGCTC AGCCTTACCG TACAGATTTC 
GATGAAGCCG GAATTGCCTA TCCCGATAAC 
AAAAAATTAT CTAAACCAGA AGAACAAGTA 
ACCGTTCAAG CGATTGCTGC TGCTCAAAAC 
TATATGGAAA CTTATTATGA TCGCGCATTG 
TTTGGAACAG CAAAATCAAC AAAAGTAACG 
GCGATGATGT ACATGGGTAG CTGGTACATG 
AAAACAAATG TCGAATGGGG GATTGCCGAA 
ACCTTTGGCT CACCGACAAG TTTTGCAATT 
CAAAAATTCT TAGACTTTGC TTCAGGTAAA 
GTGGTTCCTT CTTATAAAAC AGATGAAATT 
CCTTCAGACG AGTCTCACAA AAAGCCTTTA 



EF002-4 (SEQ ID NO: 8) 



C GGSSEKKATE KSEDGKTKLT VTTWNYDTTP 

EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTTMLSSGD TTDILIMKNL LSYSNYALRN 
QLVDLTDHVK DLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKKMFD EAGIAYPDNL 
TWDEYEALAK KLSKPEEQVY GAYQHTWRST VQAIAAAQNN ANLIEPKYNY METYYDRALR 
MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MMYMGSWYMG TLLTNIDDGK TNVEWGIAEI 
PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE GAKLLAEVGV VPSYKTDEID 
KIYFARKGMP SDESHKKPLT QIQLI 



EF003-1 (SEQ ID N0:9) 

TAGGAGGACA AAAGAATGAA 
ATTTTAGCTG CCTGTGGGGG 
GTTGCCGTGC AATTGGAATC 
AAAAAAGGGT ACAAAATTAA 
GTGCAACATG ACGAAGCGGA 
AACAAAGAGA AAAAAGCTGA 
TTCTATTCAA AAGAATACCA 
CCTAGCGATC CAACCAATGA 
AAATTAAAAG AAGGTGTCGG 
AACATCACTT TTGAAAGCAT 
ATCGCTATGG TGTTCTGCTA 
GCGATCTTGT TAGAAGATAA 
AAAGGCGAAA AAGATAGCGA 
GTTGCTGAAT ACATCAAGAA 



GAAGTTTTAT TTAGCNACAT 
AAATAAACAA GCAGACCAGA 
TTCAAAAGAT ATCTTGGAGA 
CATTATGGAA GTGAGCGACA 
TGCTAATTTT GCGCAACATC 
TTTAGTGGCT GTGCAACCGA 
AGATGCGAAA GATTTACCTG 
AGGTCGTGCT TTAGCAATTT 
CTTTAACGGC ACGGTGGCAG 
TGATTTACTG AATTTAGCTA 
CCCAGCCTAC TTAGAACCTG 
AGAAGCAAGT AAACATTACG 
AAAAATCAAG GTTTTAAAAG 
AAATTCTAAA GGCGCCAATA 



TGGCTGTTAT TGCAACAGTT 
AAGAAGACAA GGAGATTACC 
TTGCCAAGAA AGAAGCTGAG 
ATGTTGCCTA CAACGATGCC 
AACCCTTCAT GGAAATGTTT 
TTTATTATTT TGCTGGTGGT 
AAAATGCCAA AGTGGGGATT 
TAAATGCAAA CGGCGTGATT 
AT6TCGTGGA AAATCCTAAA 
AAGCCTATGA TGAAAAAGAC 
CTGGTTTAAC AACGAAAGAT 
CATTGCAAGT TGTGACACGC 
AAGCGATGAC AACAAAAGAA 
TTCCTGCGTT TTAA 



EF003-2 (SEQ ID NO:10) 
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MKKFYL ATFAVIATVI LAACGGNKQA DQKEDKEITV AVQLESSKDI LEIAKKEAEK 
KGYKINIMEV SDNVAYNDAV QHDEADANFA QHQPFMEMFN KEKKADLVAV QPIYYFAGGF 
YSKEYQDAKD LPENAKVGIP SDPTNEGRAL AILNANGVIK LKEGVGFNGT VADWENPKN 
ITFESIDLLN LAKAYDEKDI AMVFCYPAYL EPAGLTTKDA ILLEDKEASK HYALQWTRK 
GEKDSEKIKV LKEAMTTKEV AEYIKKNSKG ANIPAF 

EF003-3 (SEQ ID N0:11) 

CTGTGGGGG AAATAAACAA GCAGACCAGA AAGAAGACAA GGAGATTACC 
GTTGCCGTGC AATTGGAATC TTCAAAAGAT ATCTTGGAGA TTGCCAAGAA AGAAGCTGAG 
AAAAAAGGGT ACAAAATTAA CATTATGGAA GTGAGCGACA ATGTTGCCTA CAACGATGCC 
GTGCAACATG ACGAAGCGGA TGCTAATTTT GCGCAACATC AACCCTTCAT GGAAATGTTT 
AACAAAGAGA AAAAAGCTGA TTTAGTGGCT GTGCAACCGA TTTATTATTT TGCTGGTGGT 
TTCTATTCAA AAGAATACCA AGATGCGAAA GATTTACCTG AAAATGCCAA AGTGGGGATT 
CCTAGCGATC CAACCAATGA AGGTCGTGCT TTAGCAATTT TAAATGCAAA CGGCGTGATT 
AAATTAAAAG AAGGTGTCGG CTTTAACGGC ACGGTGGCAG ATGTCGTGGA AAATCCTAAA 
AACATCACTT TTGAAAGCAT TGATTTACTG AATTTAGCTA AAGCCTATGA TGAAAAAGAC 
ATCGCTATGG TGTTCTGCTA CCCAGCCTAC TTAGAACCTG CTGGTTTAAC AACGAAAGAT 
GCGATCTTGT TAGAAGATAA AGAAGCAAGT AAACATTACG CATTGCAAGT TGTGACACGC 
AAAGGCGAAA AAGATAGCGA AAAAATCAAG GTTTTAAAAG AAGCGATGAC AACAAAAGAA 
GTTGCTGAAT ACATCAAGAA AAATTCTAAA GGCGCCAATA TTCCTGCGTT T 



EF003-4 {SEQ ID NO: 12) 

CGGNKQA DQKEDKEITV AVQLESSKDI LEIAKKEAEK 

KGYKINIMEV SDNVAYNDAV QHDEADANFA QHQPFMEMFN KEKKADLVAV QPIYYFAGGF 
YSKEYQDAKD LPENAKVGIP SDPTNEGRAL AILNANGVIK LKEGVGFNGT VADWENPKN 
ITFESIDLLN LAKAYDEKDI AMVFCYPAYL EPAGLTTKDA ILLEDKEASK HYALQWTRK 
GEKDSEKIKV LKEAMTTKEV AEYIKKNSKG ANIPAF 



EF004-1 (SEQ ID N0:13) 

TAAATCGAAA GAAGGATGAT AGAAATGAAA AAAATGATTA AATTTGCAGG CATTGCTCTT 
ATTTTTGCAG CTCTTCTCTC TGCCTGTAGC AACGCAAAAA ATAATACACA AAAGAAAGCC 
GAAACTGCTG CCCAGTCAAG CACTATTGAA GCTTCAGACA GTAACGAAAA CGAGCCTAAT 
ACAGAAAACA TAACCCAAGC AGTTAAACAG TTAGAAGAAA AATTTAACTC TGACGAGAAA 
TTAGTAAAAA TAGATGTTAA AAATAATGTT AAAGATGACA CATCAGATAA CCCTCACGCT 
GTCATTACGG TTAAGGTAAT TAATGATGAA GCAAAAAAAA ATATGGAAGA AATGCAGACT 
GCGATAGATT CCAACTCAGG TACAGAGGCA CAAAAGACTG CCATATACGG AATTCAATTA 
AATGTTGAAG AAGTAGCCAA AACATTAGAA AATGATAACG ATGTTATTTC TTTCATCACA 
CCTTACACGA ATGGGAACGA CAGAACCATA GCAAAATCAA CTAAAAATGA AAATATTATT 
CCGTTAGTAA AATAA 

EF004-2 (SEQ ID NO: 14) 

MKK MIKFAGIALI FAALLSACSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT 
ENITQAVKQL EEKFNSDEKL VKIDVKNNVK DDTSDNPHAV ITVKVINDEA KKNMEEMQTA 
IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENIIP 
LVK 



EF004-3 (SEQ ID NO: 15) 
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CTGTAGC AACGCAAAAA ATAATACACA AAAGAAAGCC 

GAAACTGCTG CCCAGTCAAG CACTATTGAA GCTax:AGACA GTAACGAAAA CGAGCCTAAT 
ACAGAAAACA TAACCCAAGC AGTTAAACAG TTAGAAGAAA AATTTAACTC TGACGAGAAA 
TTAGTAAAAA TAGATGTTAA AAATAATGTT AAAGATGACA CATCAGATAA CCCTCACGCT 
GTCATTACGG TTAAGGTAAT TAATGATGAA GCAAAAAAAA ATATGGAAGA AATGCAGACT 
GCGATAGATT CCAACTCAGG TACAGAGGCA CAAAAGACTG CCATATACGG AATTCAATTA 
AATGTTGAAG AAGTAGCCAA AACATTAGAA AATGATAACG ATGTTATTTC TTTCATCACA 
CCTTACACGA ATGGGAACGA CAGAACCATA GCAAAATCAA CTAAAAATGA AAATATTATT 
CCGTTAGTAA AA 



EF004-4 (SEQ ID NO: 16) 

CSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT 

ENITQAVKQL EEKFNSDEKL VKIDVKNNVK DDTSDNPHAV ITVKVINDEA KKNMEEMQTA 
IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENIIP 
LVK 



EF005-1 (SEQ ID NO: 17) 

TAAAAAATGA AAAAACGATT GACGATTGTG GGGATGCTTT TTCTGGCCAT TTTAGTAATG 
GTTGGTTGTG GTAAAAATCA GCAAGCAACG ACAAAAGAAA AAGAGACAAA ACCTGAAGAA 
CTAACTCTTT ACATTGTGCG CCACGGAAAA ACCATGTTAA ATACGAGGGA CCGCGTACAA 
GGATGGTCAG ATGCGGTCCT AACACCAGAA GGTGAAAAAG TTGTGACAGC AACTGGGATT 
GGACTGAAAG ATGTTGCCTT TCAAAATGCA TATAGTAGTG ATAGTGGCCG CGCCTTGCAA 
ACTGCTCAAC TTATTTTAGA TCAAAATAAA GCAGGCAAAG ACCTTGAAGT CGTGCGTGAC 
CCAGATTTAC GTGAATTTAA TTTTGGTAGC TATGAAGGGG ATTTAAATAA GACAATGTGG 
CAGGATATTG CTGATGATCA AGGTGTTTCC TTAGAAGAAT TTATGAAAAA CATGACTCCT 
GAATCCTTTG CCAATAGTGT AGCTAAACTG GATCAACAGC GCGAGGAAAG CAAGAATAAC 
TGGCCTGCAG AAGACTATGC TACAATTACT AAACGTTTGA AAAAAGGCTT AGATAAAATT 
GTTGCCACAG AATCAGCCAA TTCTGGGAAT GGCAATGTTT TAGTGGTCTC TCATGGCTTG 
AGTATTTCAG CGTTGTTAGC AACTTTATTT GATGATTTTA AAGTCCCAGA AGGCGGTTTG 
AAGAATGCTA GTGTCACAAC AATTCATTAC AAAAATGGCG AATATACTTT GGATAAAGTC 
AATGATCTCA GCTACTTAGA AGCAGGCGAA AAAGAATCAA AATAA 

EF005-2 (SEQ ID NO: 18) 

MKKRLTIVG MLFLAILVMV GCGKNQQATT KEKETKPEEL TLYIVRHGKT MLNTTDRVQG 
WSDAVLTPEG EKWTATGIG LKDVAFQNAY SSDSGRALQT AQLILDQNKA GKDLEWRDP 
DLREFNFGSY EGDLNKTMWQ DIADDQGVSL EEFMKNMTPE SFANSVAKLD QQREESKNNW 
PAEDYATITK RLKKGLDKXV ATESANSGNG NVLWSHGLS ISALLATLFD DFKVPEGGLK 
NASVTTIHYK NGEYTLDKVN DVSYLEAGEK ESK 

EF005-3 (SEQ ID NO: 19) 

TTGTG GTAAAAATCA GCAAGCAACG ACAAAAGAAA AAGAGACAAA ACCTGAAGAA 
CTAACTCTTT ACATTGTGCG CCACGGAAAA ACCATGTTAA ATACGACGGA CCGCGTACAA 
GGATGGTCAG ATGCGGTCCT AACACCAGAA GGTGAAAAAG TTGTGACAGC AACTGGGATT 
GGACTGAAAG ATGTTGCCTT TCAAAATGCA TATAGTAGTG ATAGTGGCCG CGCCTTGCAA 
ACTGCTCAAC TTATTTTAGA TCAAAATAAA GCAGGCAAAG ACCTTGAAGT CGTGCGTGAC 
CCAGATTTAC GTGAATTTAA TTTTGGTAGC TATGAAGGGG ATTTAAATAA GACAATGTGG 
CAGGATATTG CTGATGATCA AGGTGTTTCC TTAGAAGAAT TTATGAAAAA CATGACTCCT 
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GAATCCTTTG CCAATAGTGT AGCTAAACTG GATCAACAGC GCGAGGAAAG CAAGAATAAC 
TGGCCTGCAG AAGACTATGC TACAATTACT AAACGTTTGA AAAAAGGCTT AGATAAAATT 
GTTGCCACAG AATCAGCCAA TTCTGGGAAT GGCAATGTTT TAGTGGTCTC TCATGGCTTG 
AGTATTTCAG CGTTGTTAGC AACTTTATTT GATGATTTTA AACTCCCAGA AGGCGGTTTG 
AAGAATGCTA GTGTCACAAC AATTCATTAC AAAAATGGCG AATATACTTT GGATAAAGTC 
AATGATGTCA GCTACTTAGA AGCAGGCGAA AAAGAATCAA AA 

EF005-4 (SEQ ID NO:20) 

CGKNQQATT KEKETKPEEL TLYIVRHGKT MLNTTDRVQG 

WSDAVLTPEG EKWTATGIG LKDVAFQNAY SSDSGRALQT AQLILDQNKA GKDLEWRDP 
DLREFNFGSY EGDLNKTMWQ DIADDQGVSL EEFMKNMTPE SFANSVAKLD QQREESKNNW 
PAEDYATITK RLKKGLDKIV ATESANSGNG NVLWSHGLS ISALLATLFD DFKVPEGGLK 
NASVTTIHYK NGEYTLDKVN DVSYLEAGEK ESK 



EF006-1 (SEQ ID N0:21) 

TAAACGATAA ATGGAGGGAA TAAGATGAAA AAACGTACAT TATGGTCAGT AATTACTGTA 
GCAGTAGCTG TCTTAGTTTT AGGGGCTTGC GGCAATAAAA AGAGTGATGA CTCGGTCTTG 
AAAGTTGGAG CTTCACCAGT TCCACATGCA GAGATTTTAG AACATGTAAA ACCTTTATTA 
GAAAAAGAAG GCGTAAAATT AGAAGTGACG ACTTATACAG ATTACGTGCT ACCTAACAAG 
GCGTTGGAAA GTGGCGATAT CGATGCCAAC TATTTCCAAC ATGTGCGGTT CTTTAATGAA 
GCGGTTAAAG AAAATGATTA TGACTTTGTG AATGCAGGTG CGATTCATTT AGAACCAGTT 
GGGCTTTACT CGAAAAAATA CAAATCGTTA CAAGAAATTC CTGATGGTTC AACGATTTAC 
GTTAGCTCTT CCGTTTCAGA TTGGCCACGC GTATTAACTA TCTTAGAAGA TGCTGGTTTA 
ATCACGCTGA AAGAAGGGGT AGACCGGACA ACTGCTACTT TCGATGATAT TGATAAAAAT 
ACTAAAAAGT TGAAATTCAA TCATGAAAGT GATCCAGCAA TCATGACCAC TCTTTATGAC 
AATGAAGAAG GGGCTGCGGT TTTAATTAAC TCAAACTTTG CCGTGGATCA AGGATTAAAT 
CCGAAAAAAG ATGCGATTGC CTTAGAAAAA GAAAGTTCAC CTTATGCCAA TATTATTGCG 
GTTCGTAAAG AAGACGAAAA CAACGAAAAT GTAAAAAAAT TAGTCAAAGT GTTACGTAGC 
AAAGAAGTCC AAGATTGGAT TACGAAAAAA TGGAACGGCG CTATTGTTCC AGTCAATGAA 



EF006-2 (SEQ ID N0:22) 

MKK RTLWSVITVA VAVLVLGACG NKKSDDSVLK VGASPVPHAE ILEHVKPLLE 
KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNEA VKENDYDFVN AGAIHLEPVG 
LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI TLKEGVDRTT ATFDDIDKNT 
KKLKFNHESD PAIMTTLYDN EEGAAVLINS NFAVDQGLNP KKDAIALEKE SSPYANIIAV 
RKEDENNENV KKLVKVLRSK EVQDWITKKW NGAIVPVNE 

EF006-3 (SEQ ID NO:23) 

TTGC GGCAATAAAA AGAGTGATGA CTCGGTCTTG 

AAAGTTGGAG CTTCACCAGT TCCACATGCA GAGATTTTAG AACATGTAAA ACCTTTATTA 
GAAAAAGAAG GCGTAAAATT AGAAGTGACG ACTTATACAG ATTACGTGCT ACCTAACAAG 
GCGTTGGAAA GTGGCGATAT CGATGCCAAC TATTTCCAAC ATGTGCCGTT CTTTAATGAA 
GCGGTTAAAG AAAATGATTA TGACTTTGTG AATGCAGGTG CGATTCATTT AGAACCAGTT 
GGGCTTTACT CGAAAAAATA CAAATCGTTA CAAGAAATTC CTGATGGTTC AACGATTTAC 
GTTAGCTCTT CCGTTTCAGA TTGGCCACGC GTATTAACTA TCTTAGAAGA TGCTGGTTTA 
ATCACGCTCA AAGAAGGGGT AGACCGGACA ACTGCTACTT TCGATGATAT TGATAAAAAT 
ACTAAAAAGT TGAAATTCAA TCATGAAAGT GATCCAGCAA TCATGACCAC TCTTTATGAC 
AATGAAGAAG GGGCTGCGGT TTTAATTAAC TCAAACTTTG CCGTGGATCA AGGATTAAAT 
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CCGAAAAAAG ATGCGATTGC CTTAGAAAAA GAAAGTTCAC CTTATGCCAA TATTATTGCG 
GTTCGTAAAG AAGACGAAAA CAACGAAAAT GTAAAAAAAT TAGTCAAAGT GTTACGTAGC 
AAAGAAGTCC AAGATTGGAT TACGAAAAAA TGGAACGGCG CTATTGTTCC AGTCAATGAA 



EF006-4 (SEQ ID NO:24) 

CG NKKSDDSVLK VGASPVPHAE ILEHVKPLLE 

KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNEA VKENDYDFVN AGAIHLEPVG 
LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI TLKEGVDRTT ATFDDIDKNT 
KKLKFNHESD PAIMTTLYDN EEGAAVLINS NFAVDQGLNP KKDAIALEKE SSPYANIIAV 
RKEDENNENV KKLVKVLRSK EVQDWITKKW NGAIVPVNE 

EF008-1 {SEQ ID NO:25) 

TAAACCGTGA GAAAGAAATG GAGGAATCAA CGAATGAAAA AATTTAGTTT ATTTTTTTTA 
ACACTTTTAG CAGGGTTAAC GTTAGCTGCT TGCGGGAATC AAGCCGCTGA AAAGAAAGAA 
AAATTAGCAA TTGTGACAAC GAACTCGATC CTATCTGATT TAGTGAAAAA TGTTGGGCAA 
GACAAAATTG AGCTGCATAG TATTGTGCCA ATTGGGACAG ACCCTCACGA ATATGAACCG 
TTACCAGAAG ACATTGCGAA AGCTTCTGAA GCGGACATTT TATTCTTTAA CGGCTTGAAC 
TTAGAAACAG GCGGAAATGG CTGGTTTAAC AAATTAATGA AAACGGCCAA AAAAGTTGAG 
AATAAAGATT ACTTTTCTAC AAGCAAAAAT GTTACGCCAC AATATTTAAC AAGTGCCGGT 
CAAGAACAAA CAGAAGATCC ACATGCTTGG TTAGACATTG AAAATGGCAT TAAATATGTA 
GAAAACATTC GTGACGTGTT AGTAGAAAAA GATCCAAAAA ATAAAGATTT CTATACAGAA 
AACGCGAAAA ATTATACCGA AAAACTTAGC AAACTACATG AGGAAGCCAA AGCTAAATTT 
GCTGATATTC CTGATGATAA AAAATTATTA GTTACAAGTG AAGGTGCCTT TAAATATTTC 
TCCAAAGCTT ATGATTTAAA TGCCGCTTAT ATTTGGGAAA TTAACACAGA AAGTCAAGGN 
ACACCTGAAC AAATGACCAC GATTATTGAT ACCATTAAGA AATCAAAAGC ACCTGTGTTA 
TTTGTTGAAA CCAGTGTCGA TAAACGTAGT ATGGAACGGG TCTCAAAAGA AGTGAAACGA 
CCAATTTACG ATACACTTTT CACAGACTCT CTTGCCAAAG AAGGAACAGA AGGCGATACG 
TACTACAGCA TGATGAACTG GAATTTAACA AAAATCCATG ATGGCTTAAT GAGTAAATAA 



EF008-2 (SEQ ID NO: 26) 

MKKFSLFFLT LLAGLTLAAC GNQAAEKKEK LAIVTTNSIL SDLVKNVGQD 
KIELHSIVPI GTDPHEYEPL PEDIAKASEA DILFFNGLNL ETGGNGWFNK LMKTAKKVEN 
KDYFSTSKNV TPQYLTSAGQ EQTEDPHAWL DIENGIKYVE NIRDVLVEKD PKNKDFYTEN 
AKNYTEKLSK LHEEAKAKFA DIPDDKKLLV TSEGAFKYFS KAYDLNAAYI WEINTESQGT 
PEQMTTIIDT IKKSKAPVLF VETSVDKRSM ERVSKEVKRP lYDTLFTDSL AKEGTEGDTY 
YSMMNWNLTK IHDGLMSK 

EF008-3 (SEQ ID NO:27) 

T TGCGGGAATC AAGCCGCTGA AAAGAAAGAA 

AAATTAGCAA TTGTGACAAC GAACTCGATC CTATCTGATT TAGTGAAAAA TGTTGGGCAA 
GACAAAATTG AGCTGCATAG TATTGTGCCA ATTGGGACAG ACCCTCACGA ATATGAACCG 
TTACCAGAAG ACATTGCGAA AGCTTCTGAA GCGGACATTT TATTCTTTAA CGGCTTGAAC 
TTAGAAACAG GCGGAAATGG CTGGTTTAAC AAATTAATGA AAACGGCCAA AAAAGTTGAG 
AATAAAGATT ACTTTTCTAC AAGCAAAAAT GTTACGCCAC AATATTTAAC AAGTGCCGGT 
CAAGAACAAA CAGAAGATCC ACATGCTTGG TTAGACATTG AAAATGGCAT TAAATATGTA 
GAAAACATTC GTGACGTGTT AGTAGAAAAA GATCCAAAAA ATAAAGATTT CTATACAGAA 
AACGCGAAAA ATTATACCGA AAAACTTAGC AAACTACATG AGGAAGCCAA AGCTAAATTT 
GCTGATATTC CTGATGATAA AAAATTATTA GTTACAAGTG AAGGTGCCTT TAAATATTTC 
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TCCAAAGCTT ATGATTTAAA 
ACACCTGAAC AAATGACCAC 
TTTGTTGAAA CCAGTGTCGA 
CCAATTTACG ATACACTTTT 
TACTACAGCA TGATGAACTG 



TGCCGCTTAT ATTTGGGAAA 
GATTATTGAT ACCATTAAGA 
TAAACGTAGT ATGGAACGGG 
CACAGACTCT CTTGCCAAAG 
GAATTTAACA AAAATCCATG 



TTAACACAGA AAGTCAAGGN 
AATCAAAAGC ACCTGTGTTA 
TCTCAAAAGA AGTGAAACGA 
AAGGAACAGA AGGCGATACG 
ATGGCTTAAT GAGTAAA 



EF008-4 (SEQ ID NO:28) 

C GNQAAEKKEK LAIVTTNSIL SDLVKNVGQD 

KIELHSIVPI GTDPHEYEPL PEDIAKASEA DILFFNGLNL ETGGNGWFNK LMKTAKKVEN 
KDYFSTSKNV TPQYLTSAGQ EQTEDPHAWL DIENGIKYVE NIRDVLVEKD PKNKDFYTEN 
AKNYTEKLSK LHEEAKAKFA DIPDDKKLLV TSEGAFKYFS KAYDLNAAYI WEINTESQGT 
PEQMTTIIDT IKKSKAPVLF VETSVDKRSM ERVSKEVKRP lYDTLFTDSL AKEGTEGDTY 
YSMMNWNLTK IHDGLMSK 



EF009-1 (SEQ ID NO:29) 



TGACAAATGA AAAAATTTAG TAAATTAATT 
GCAGGTTGTG CATCGGGGTC TGTGAAGGAT 
GTAGGAACAA AAAATGATGA ATGGGAATCG 
GATTTACAAT TGGTAGAATT TACAGACTAT 
GAAATTGATT TAAATGCCTT TCAGCATCAA 
GGAACGAAAT TAGTATCAAT TGGCAATACA 
AAATTGAAAG ATATCACGAA AATTAAAGAC 
ACGAATGGCG GGCGGGCGTT AATTTTATTA 
GCGAAACAGC AACTACCGAC TGTCAGTGAT 
ACTGAATTAG ATGCTACGCA AACAGCGCGC 
AATAGCGGCA TGGCTGTCGA TGCTGGGTAT 
CCTGTAAACG AAAAAGCGAA ACCTTATGTG 
GAGAATAAAC TTTATCAAAA AGTTGTAGAA 
ATTGCAGAAA CATCAAAAGG CGCCAATGTT 

EF009-2 {SEQ ID NO:30) 



GGACTTATTG GGGTATTAGC TTTTACGATT 
ACTAAGACAG, AAACCGTTAA ACTAGGGGTT 
GTCAAAGACC GTTTGAAAAA GAAAAATATT 
ACGCAACCAA ACGCAGCATT AGCAGAAAAA 
ATCTTTTTAG ACAATTACAA TAAAGAGCAT 
GTCAATGCAC CATTGGGAAT TTACOCTAAT 
GGCGGAGAAA TTGCTATTCC TAATGACCCA 
CAAACTGCAG GACTGATAAA AGTAGATCCT 
ATTACTGAAA ATAAACGCCA ATTGAA/^TA 
GCTTTACAAG ATGTCGATGC TTCAGTGATT 
ACACCAGATA AAGATGCTAT TTTCTTAGAA 
AACATTGTCG TGGCCCGAGA AGAAGATCAA 
GAATATCAAC AAGAAGAAAC GAAAAAGGTC 
CCAGCCTGGG AAACATTTGG TAAAAAATAA 



MKKFSKLIG LIGVLAFTIA GCASGSVKDT KTETVKLGW GTKNDEWESV KDRLKKKNID 
LQLVEFTDYT QPNAALAEKE IDLNAFQHQI FLDNYNKEHG TKLVSIGNTV NAPLGIYANK 
LKDITKIKDG GEIAIPNDPT NGGRALILLQ TAGLIKVDPA KQQLPTVSDI TENKRQLKIT 
ELDATQTARA LQDVDASVIN SGMAVDAGYT PDKDAIFLEP VNEKAKPYVN IWAREEDQE 
NKLYQKWEE YQQEETKKVI AETSKGANVP AWETFGKK 

EF009-3 (SEQ ID N0:31) 

TTGTG CATCGGGGTC TGTGAAGGAT ACTAAGACAG AAACCGTTAA ACTAGGGGTT 
GTAGGAACAA AAAATGATGA ATGGGAATCG GTCAAAGACC GTTTGAAAAA GAAAAATATT 
GATTTACAAT TGGTAGAATT TACAGACTAT ACGCAACCAA ACGCAGCATT AGCAGAAAAA 
GAAATTGATT TAAATGCCTT TCAGCATCAA ATCTTTTTAG ACAATTACAA TAAAGAGCAT 
GGAACGAAAT TAGTATCAAT TGGCAATACA GTCAATGCAC CATTGGGAAT TTACGCTAAT 
AAATTGAAAG ATATCACGAA AATTAAAGAC GGCGGAGAAA TTGCTATTCC TAATGACCCA 
ACGAATGGCG GGCGGGCGTT AATTTTATTA CAAACTGCAG GACTGATAAA AGTAGATCCT 
GCGAAACAGC AACTACCGAC TGTCAGTGAT ATTACTGAAA ATAAACGCCA ATTGAAAATA 
ACTGAATTAG ATGCTACGCA AACAGCGCGC GCTTTACAAG ATGTCGATGC TTCAGTGATT 
AATAGCGGCA TGGCTGTCGA TGCTGGGTAT ACACCAGATA AAGATGCTAT TTTCTTAGAA 
CCTGTAAACG AAAAAGCGAA ACCTTATGTG AACATTGTCG TGGCCCGAGA AGAAGATCAA 
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GAGAATAAAC TTTATCAAAA AGTTGTAGAA GAATATCAAC AAGAAGAAAC GAAAAAGGTC 
ATTGCAGAAA CATCAAAAGG CGCCAATGTT CCAGCCTGGG AAACATTTGG TAAAAAA 



EF009-4 (SEQ ID NO:32) 

CASGSVKDT KTETVKLGW GTKNDEWESV KDRLKKKNID 

LQLVEFTDYT QPNAALAEKE IDLNAFQHQI FLDNYNKEHG TKLVSIGNTV NAPLGIYANK 
LKDITKIKDG GEIAIPNDPT NGGRALILLQ TAGLIKVDPA KQQLPTVSDI TENKRQLKIT 
ELDATQTARA LQDVDASVIN SGMAVDAGYT PDKDAIFLEP VNEKAKPYVN IWAREEDQE 
NKLYQKWEE YQQEETKKVI AETSKGANVP AWETFGKK 



EFOlO-1 (SEQ ID NO:33) 

TGAAAGAATA AAATTGTACA GGAGGAAATA AGGAATGAAA AAATGGCAAA AAGGATTAGC 
CGTAGCTGGC GCACAGCTTT AGCTGTAGGA CTAAGCGCGT GCGGTAAATC TTCAAAAGAT 
GCAGCGTCAA AAGGTGATGA TAGTACACCA ACGTTATTAA TGTATCGTGT TGGGGACAAA 
CCAGATAATT ATGACCAATT AATCGATAAT GCGAATAAAA TTATCGAGAA AAAAATTGGG 
GCAAAATTAA AAATGGAATT TGTTGGTTGG GGCGATTGGG ACCAAAAAAT GTCAACAATC 
GTTGCTTCTG GTGAAAGCTA TGATATTTCA TTAGCACAAA ATTATGCAAC GAATGCACAA 
AAAGGCGCCT ATGCTGATTT AACTGATTTA GCACCTAAAT ATGCCAAAGA AGCCTATGAT 
CAATTGCCAG ATAACTATAT TAAAGGAAAT ACGATTAATG GAAAACTGTA TGCGTTCCCA 
ATTTTAGGTA ACTCTTACGG TCAACAAGTT TTAACTTTTA ATAAAGAATA TGTCGATAAA 
TACAATTTAG ATATTAGTAA AGTCGATGGT AGTTATGAAA GTGCAACGGA AGTTCTAAAA 
GAATTCCNTA AAAANGANCC AAATATTGCT GCTTTTGCTA TCGGCCAAAC ATTCTTTGCA 
ACAGGTAATT ATGACTTCCC TATTGGTAAC CAATATCCAT TTGCAGTAAA AACAACTGAT 
ACTGGCTCAC CAAAAATTAT TAACCAATAT GCCGACAAAG ACATGATTAA TAACTTAAAA 
GTCTTGCATC AATGGTATAA AGATGGCTTG ATTCCAACAG ATGCTGCTAC AAGTACAACA 
CCATATGACT TAAATACCAA TACTTGGTTT ATGCGTCAAG AAACACAAGG ACCTATGGAT 
TATGGTGATA CAATCTTAAC ACAAGCTGCT GGCAAACCAC TTGTTTCTCG TCCACTAACA 
GAACCATTAA AAACAACAGC TCAAGCGCAA ATGGCTAACT ATGTTGTTGC AAACACGTCT 
AAAAACAAAG AAAAATCTGT TGAATTGTTA GGTTTATTAA ACAGCAATCC AGAATOXSTTA 
AACGGACTTG TTTATGGTGA AGAAGGCAAA CAATATGAAA AAGTTGGCGA TGATCGTGTG 
AAATTGTTGA AAGATTACAC ACCAACAACT CATTTGAGTG CTTGGAACAC AGGAAACAAC 
TTAATCATTT GGCCAGAAGA ATCTGTCACT GAAGAAATGG TTAAAGAACG TGATAAGAGC 
ATCGAAGAAG CAAAAGATTC ACCAATTCTT GGTTTTACTT TTGTAAATGA TAAAGTGAAA 
ACTGAAATCA CTAACGTTGC TACAGTTATG AACCGTTACG CAGCAAGCTT AAATACAGGA 
ACTGTTGATC CAGAAGAAAC ACTTCCAAAA TTAATGGATG ACCTAAAAAC AGCTGGCTGG 
GATAAAGTTC AAAAAGAAAT GCAAACACAA TTAGACGAAT ATATCCAATC TCAAAAATAA 

EFOlO-2 (SEQ ID NO:34) 

MAKRISR SWRTALAVGL SACGKSSKDA ASKGDDSTPT LLMYRVGDKP 
DNYDQLIDNA NKIIEKKIGA KLKMEFVGWG DWDQKMSTIV ASGESYDISL ' AQNYATNAQK 
GAYADLTDLA PKYAKEAYDQ LPDNYIKGNT INGKLYAFPI LGNSYGQQVL TFNKEYVDKY 
NLDISKVDGS YESATEVLKE FXKXXPNIAA FAIGQTFFAT GNYDFPIGNQ YPFAVKTTDT 
GSPKIINQYA DKDMINNLKV LHQWYKDGLI PTDAATSTTP YDLNTNTWFM RQETQGPMDY 
GDTILTQAAG KPLVSRPLTE PLKTTAQAQM ANYWANTSK NKEKSVELLG LLNSNPELLN 
GLVYGEEGKQ YEKVGDDRVK LLKDYTPTTH LSAWNTGNNL IIWPEESVTE EMVKERDKSI 
EEAKDSPILG FTFVNDKVKT EITNVATVMN RYAASLNTGT VDPEETLPKL MDDLKTAGWD 
KVQKEMQTQL DEYIQSQK 



EFOlO-3 (SEQ ID NO:35) 
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GT GCGGTAAATC TTCAAAAGAT 
GCAGCGTCAA AAGGTGATGA TAGTACACCA 
CCAGATAATT ATGACCAATT AATCGATAAT 
GCAAAATTAA AAATGGAATT TGTTGGTTGG 
GTTGCTTCTG GTGAAAGCTA TGATATTTCA 
AAAGGCGCCT ATGCTGATTT AACTGATTTA 
CAATTGCCAG ATAACTATAT TAAAGGAAAT 
ATTTTAGGTA ACTCTTACGG TCAACAAGTT 
TACAATTTAG ATATTAGTAA AGTCGATGGT 
GAATTCCNTA AAAANGANCC AAATATTGCT 
ACAGGTAATT ATGACTTCCC TATTGGTAAC 
ACTGGCTCAC CAAAAATTAT TAACCAATAT 
GTCTTGCATC AATGGTATAA AGATGGCTTG 
CCATATGACT TAAATACCAA TACTTGGTTT 
TATGGTGATA CAATCTTAAC ACAAGCTGCT 
GAACCATTAA AAACAACAGC TCAAGCGCAA 
AAAAACAAAG AAAAATCTGT TGAATTGTTA 
AACGGACTTG TTTATGGTGA AGAAGGCAAA 
AAATTGTTGA AAGATTACAC ACCAACAACT 
TTAATCATTT GGCCAGAAGA ATCTGTCACT 
ATCGAAGAAG CAAAAGATTC ACCAATTCTT 
ACTGAAATCA CTAACGTTGC TACAGTTATG 
ACTGTTGATC CAGAAGAAAC ACTTCCAAAA 
GATAAAGTTC AAAAAGAAAT GCAAACACAA 



ACGTTATTAA TGTATCGTGT TGGGGACAAA 
GCGAATAAAA TTATCGAGAA AAAAATTGGG 
GGCGATTGGG ACCAAAAAAT GTCAACAATC 
TTAGCACAAA ATTATGCAAC GAATGCACAA 
GCACCTAAAT ATGCCAAAGA AGCCTATGAT 
ACGATTAATG GAAAACTGTA TGCGTTCCCA 
TTAACTTTTA ATAAAGAATA TGTCGATAAA 
AGTTATGAAA GTGCAACGGA AGTTCTAAAA 
GCTTTTGCTA TCGGCCAAAC ATTCTTTGCA 
CAATATCCAT TTGCAGTAAA AACAACTGAT 
GCCGACAAAG ACATGATTAA TAACTTAAAA 
ATTCCAACAG ATGCTGCTAC AAGTACAACA 
ATGCGTCAAG AAACACAAGG ACCTATGGAT 
GGCAAACCAC TTGTTTCTCG TCCACTAACA 
ATGGCTAACT ATGTTGTTGC AAACACGTCT 
GGTTTATTAA ACAGCAATCC AGAATTGTTA 
CAATATGAAA AAGTTGGCGA TGATCGTGTG 
CATTTGAGTG CTTGGAACAC AGGAAACAAC 
GAAGAAATGG TTAAAGAACG TGATAAGAGC 
GGTTTTACTT TTGTAAATGA TAAAGTGAAA 
AACCGTTACG CAGCAAGCTT AAATACAGGA 
TTAATGGATG ACCTAAAAAC AGCTGGCTGG 
TTAGACGAAT ATATCCAATC TCAAAAA 



EFOlO-4 (SEQ ID NO:36) 

CGKSSKDA ASKGDDSTPT LLMYRVGDKP 
DNYDQLIDNA NKIIEKKIGA KLKMEFVGWG 
GAYADLTDLA PKYAKEAYDQ LPDNyIKG^rT 
NLDISKVDGS YESATEVLKE FXKXXPNIAA 
GSPKIINQYA DKDMINNLKV LHQWYKDGLI 
GDTILTQAAG KPLVSRPLTE PLKTTAQAQM 
GLVYGEEGKQ YEKVGDDRVK LLKDYTPTTH 
EEAKDSPILG FTFVNDKVKT EITNVATVMN 
KVQKEMQTQL DEYIQSQK 



DWDQKMSTIV ASGESYDISL AQNYATNAQK 
INGKLYAFPI LGNSYGQQVL TFNKEYVDKY 
FAIGQTFFAT GNYDFPIGNQ YPFAVKTTDT 
PTDAATSTTP YDLNFTNTWFM RQETQGPMDY 
ANYWANTSK NKEKSVELLG LLNSNPELLN 
LSAWNTGNNL IIWPEESVTE EMVKERDKSI 
RYAASLNTGT VDPEETLPKL MDDLKTAGWD 



EFOll-1 (SEQ ID NO:37) 

TAACGTTTTT GGAGGAAAAG AATGAAAAAG 
ATGGGACTGT TAATGTTAAG TGCTTGTCAA 
ACAGAAACAA CAGCTAAAAC GGAAGTCACA 
CCCAAAAATC CTAAGAAAGT CGTTGTTTTT 
CTAGGTGTCG GTGACCGCGT GGTAGGTGCG 
AAATACCAAA AAGTTGAATC AGCAGGCGGC 
CAACTAAAAC CAGACTTAAT TATTATTTCT 
AAAGCCATTG CGCCAACCAT TTACTTAGCT 
AAACAAAATA TCGAAACGTT AGGCACTATT 
ATAACTGGCT TAGAAAAAGA AATTGCTGAC 
AATGCGCTTG TTGTGTTAGT TAACGAAGGA 
TTCGGTTTAA TTCATGATAC ATTTGGCTTC 



AAATTTTTAG CAATGATGGC AGTTTCAATG 
ACAAATAAAA AAACAGCAGA TTCTGCAACA 
GTCAAAGACA CCAATGGTCA ATTAACCX3TT 
GATAATGGTT CCTTGGATAC AATGGATGCA 
CCAACTAAAA ATATCCCTGC GTATTTGAAA 
ATTAAAGAAC CAGATTTAGA AAAAATCAAT 
GGTCGTCAAC AAGATTATCA AGAACAATTA 
GTAGATGCCA AAAATCCTTG GGCATCAACG 
TTTGATAAAG AAGAGGTAGC TAAAGAAAAA 
GTGAAAAAAC AAGCAGAAGC TAGCGCGAAT 
CAACTTTCCG CTTACGGAAA AGGCTCTCGT 
AAAGCAGCAG AC<3ATAAGAT TGAAGCTTCC 
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ACTCATGGGC AAAGTGTTTC TTACGAATAT GTTTTAGAAA AAAATCCTGG GATTCTCTTT 
GTGGTAGATC GCACCAAAGC AATTGGTGGC GACGATTCAA AAGATAACGT CGCTGCAAAC 
GAATTGATTC AAAAAACCGA TGCTGGTAAA AATGATAAAG TCATTATGCT TCAACCAGAT 
GTTTGGTATC TAAGCGGTGG TGGATTAGAA TCAATGCATT TGATGATAGA AGATGTTAAA 
AAAGGATTAG AGTAA 

EFOll-2 (SEQ ID NO:38) 

MKKK FLAMMAVSMM GLLMLSACQT NKKTADSATT ETTAKTEVTV KDTNGQLTVP 
KNPKKVWFD NGSLDTMDAL GVGDRWGAP TKNIPAYLKK YQKVESAGGI KEPDLEKINQ 
LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK QNIETLGTIF DKEEVAKEKI 
TGLEKEIADV KKQAEASANN ALWLVNEGQ LSAYGKGSRF GLIHDTFGFK AADDKIEAST 
HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE LIQKTDAGKN DKVIMLQPDV 
WYLSGGGLES MHLMIEDVKK GLE 



EFOll-3 (SEQ ID NO:39) 



TTGTCAA ACAAATAAAA AAACAGCAGA TT< 
ACAGAAACAA CAGCTAAAAC GGAAGTCACA 
CCCAAAAATC CTAAGAAAGT CGTTGTTTTT 
CTAGGTGTCG GTGACCGCGT GGTAGGTGCG 
AAATACCAAA AAGTTGAATC AGCAGGCGGC 
CAACTAAAAC CAGACTTAAT TATTATTTCT 
AAAGCCATTG CGCCAACCAT TTACTTAGCT 
AAACAAAATA TCGAAACGTT AGGCACTATT 
ATAACTGGCT TAGAAAAAGA AATTGCTGAC 
AATGCGCTTG TTGTGTTAGT TAACGAAGGA 
TTCGGTTTAA TTCATGATAC ATTTGGCTTC 
ACTCATGGGC AAAGTGTTTC TTACGAATAT 
GTGGTAGATC GCACCAAAGC AATTGGTGGC 
GAATTGATTC AAAAAACCGA TGCTGGTAAA 
GTTTGGTATC TAAGCGGTGG TGGATTAGAA 
AAAGGATTAG AG 



GTCAAAGACA CCAATGGTCA ATTAACCGTT 
GATAATGGTT CCTTGGATAC AATGGATGCA 
CCAACTAAAA ATATCCCTGC GTATTTGAAA 
ATTAAAGAAC CAGATTTAGA AAAAATCAAT 
GGTCGTCAAC AAGATTATCA AGAACAATTA 
GTAGATGCCA AAAATCCTTG GGCATCAACG 
TTTGATAAAG AAGAGGTAGC TAAAGAAAAA 
GTGAAAAAAC AAGCAGAAGC TAGCGCGAAT 
CAACTTTCCG CTTACGGAAA AGGCTCTCGT 
AAAGCAGCAG ACGATAAGAT TGAAGCTTCC 
GTTTTAGAAA AAAATCCTGG GATTCTCTTT 
GACGATTCAA AAGATAACGT CGCTGCAAAC 
AATGATAAAG TCATTATGCT TCAACCAGAT 
TCAATGCATT TGATGATAGA AGATGTTAAA 



EFOll-4 (SEQ ID NO:40) 

CQT NKKTADSATT ETTAKTEVTV KDTNGQLTVP 

KNPKKVWFD NGSLDTMDAL GVGDRWGAP TKNIPAYLKK YQKVESAGGI KEPDLEKINQ 
LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK QNIETLGTIF DKEEVAKEKI 
TGLEKEIADV KKQAEASANN ALWLVNEGQ LSAYGKGSRF GLIHDTFGFK AADDKIEAST 
HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE LIQKTDAGKN DKVIMLQPDV 
WYLSGGGLES MHLMIEDVKK GLE 



EF012-1 (SEQ ID NO: 41) 

TGAGGGGGCA ACAACATGAA ATTGGGGAAA 
CTTTTAGCCG CATGTGGCGG AACCAAAGAA 
GCAGCTGAAC AAAAAATCAG TATTAGTTCA 
CAAACAACAG ATAAAAATAC CTTTACAATG 
TTTGATGATG ATAGTGCCAC GGTGCCAGCT 



AAAGTAGTAG GTTTGATTGC AACAGGGTTT 
GCGGCAGAGA AAGTAGATTC GGGAAATTTA 
CCTGCACCAA TCTCAACATT GGATACAACA 
GCACAACATT TATTTGAAGG CCTTTATCGG 
CTAGCTAAAG ATGTCAAGAT TAGTGACGAT 
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GGGCGCAAGT ACCACTTTAC CTTGCGGGAG GGGATTAAGT GGAGCAACGG CGAGCCAATC 
ACGGCCCAAG ATTTTGTTTA TTCTTGGAAA AAACTGGTGA CACCAGCGAC GATTGGACCG 
AATGCCTATT TACTAGACAG TGTTAAAAAT AGTTTTGAAA TACGCAACGG TGAAAAGTCA 
GTCGATGAAT TAGGGATTTC AGCCCCGAAT GACAAAGAAT TCATTGTTGA ATTAAAACAG 
GCCCAACCTT CCTTCTTAGC AGTCGTTTCG ATTGCTTGGT TAGCGCCACA AAATCAAAAA 
TTTGTCGAAG CGCAAGGCAA AGATTACGCC TrGGATAGTC AACATTTACT TTATAGCGGG 
CCATTTACGC TAGCCAATTG GGATGCGACT TCAGATACTT GGACATTGAA AAAAAATCCA 
GAATACTATG ATGCGGATCA AGTGAAACTG GAAGAAGTTG CGGTTAGCAC AATCAAAGAA 
GATAATACTG GGATTAACTT ATATCAAGTG AATGAACTAG ACTTAGTTCG CATTAAGGGA 
CAATATGTTC AACAATATCA AGATGATCCA GGCTATGTCA GTCATCCAGA TGTGGCCAAC 
TACTTCTTAG ATTTCAACAA AAAAGAAGGA ACGCCATTAG CGAATGTTCA TTTACGAAAA 
GCGATTGGCC AAGCAATTGA TAAAGAAGCC TTAACACAAA GTCTCTTAAA CGATGGGTCA 
AAACCCCTTA ACGGATTGAT TCCAAGTAAA CTTTATGCGA ATCCAGAAAC GGATGAAGAT 
TTCCGAGCTT ACAGTGGCGA ATATTTGAAA AATGACGTCA AAAAAGCTCA AGCTGAATGG 
ACGAAAGCCC AAGCGGATGT CGGTAAAAAA GTGAAACTTT CATTGCTGGC GGCAGACACA 
GATCAAGGAA AACGAATTGC TGAATATGTT CAAAGTCAGT TGCAAGAAAA TCTGCCAGGT 
TTAGAAATTA CCATTTCATC GCAACCAAGT AATAATGTGA ACCAATCGCG ACGTGAAAAA 
AATTATGAGT TGTCTCTTTC AGGATGGATT GCCGGCAGTA GTGAATTAGA CTCTTACTTT 
AACTTATATG CAGGAGAATC AAGTTACAAT TACGGCAATT ATCATAATGC CAAATACGAC 
CAATTGGTAG AAGAGGCACG AACGATTAAT GCCAATAATC CAGAGAAACA GTTTGCAGAA 
TACAAAGAAG CGGAAGACAT CTTGTTGAAC CAAGATGCTG CCCAAGTACC GCTGTATCAA 
AGTGCCTCAA ATTATCTAAT CAATCCTAAA TTGAAAGGCA TTAGTTATCA CTTGTATGGG 
GATTATTTCC ACTTGCGCAA TGCCTATTTA ACAGAATGA 

EF012-2 (SEQ ID NO:42) 

MKU3KK WGLIATGFL LAACGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ 
TTDKNTFTMA QHLFEGLYRF DDDSATVPAL AKDVKISDDG RKYHFTLREG IKWSNGEPIT 
AQDFVYSWKK LVTPATIGPN AYLLDSVKNS FEIRNGEKSV DELGISAPND KEFIVELKQA 
QPSFLAWSI AWLAPQNQKF VEAQGKDYAL DSEHLLYSGP FTLANWDATS DTWTLKKNPE 
YYDADQVKLE EVAVSTIKED NTGINLYQVN ELDLVRINGQ YVQQYQDDPG YVSHPDVANY 
FLDFNKKEGT PLANVHLRKA IGQAIDKEAL TQSVLNDGSK PLNGLIPSKL YANPETDEDF 
RAYSGEYLKN DVKKAQAEWT KAQADVGKKV KLSLLAADTD QGKRIAEYVQ SQLQENLPGL 
EITISSQPSN NVNQSRREKN YELSLSGWIA GSSELDSYFN LYAGESSYNY GNYHNAKYDQ 
LVEEARTINA NNPEKQFAEY KEAEDILLNQ DAAQVPLYQS ASNYLINPKL KGISYHLYGD 
YFHLRNAYLT E 



EF012-3 (SEQ ID NO: 43) 

ATGTGGCGG AACCAAAGAA GCGGCAGAGA AAGTAGATTC GGGAAATTTA 
GCAGCTGAAC AAAAAATCAG TATTAGTTCA CCTGCACCAA TCTCAACATT GGATACAACA 
CAAACAACAG ATAAAAATAC CTTTACAATG GCACAACATT TATTTGAAGG CCTTTATCGG 
TTTGATGATG ATAGTGCCAC GGTGCCAGCT CTAGCTAAAG ATGTCAAGAT TAGTGACGAT 
GGGCGCAAGT ACCACTTTAC CTTGCGGGAG GGGATTAAGT GGAGCAACGG CGAGCCAATC 
ACGGCCCAAG ATTTTGTTTA TTCTTGGAAA AAACTGGTGA CACCAGCGAC GATTGGACCG 
AATGCCTATT TACTAGACAG TGTTAAAAAT AGTTTTGAAA TACGCAACGG TGAAAAGTCA 
GTCGATGAAT TAGGGATTTC AGCCCCGAAT GACAAAGAAT TCATTGTTGA ATTAAAACAG 
GCCCAACCTT CCTTCTTAGC AGTCGTTTCG ATTGCTTGGT TAGCGCCACA AAATCAAAAA 
TTTGTCGAAG CGCAAGGCAA AGATTACGCC TTGGATAGTG AACATTTACT TTATAGCGGG 
CCATTTACGC TAGCCAATTG GGATGCGACT TCAGATACTT GGACATTGAA AAAAAATCCA 
GAATACTATG ATGCGGATCA AGTGAAACTG GAAGAAGTTG CGGTTAGCAC AATCAAAGAA 
GATAATACTG GGATTAACTT ATATCAAGTG AATGAACTAG ACTTAGTTCG CATTAACGGA 
CAATATGTTC AACAATATCA AGATGATCCA GGCTATGTCA GTCATCCAGA TGTGGCCAAC 
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TACTTCTTAG ATTTCAACAA AAAAGAAGGA ACGCCATTAG CGAATGTTCA TTTACGAAAA 
GCGATTGGCC AAGCAATTGA TAAAGAAGCC TTAACACAAA GTGTCTTAAA CGATGGGTCA 
AAACCCCTTA ACGGATTGAT TCCAAGTAAA CTTTATGCGA ATCCAGAAAC GGATGAAGAT 
TTCCGAGCTT ACAGTGGCGA ATATTTGAAA AATGACGTCA AAAAAGCTCA AGCTGAATGG 
ACGAAAGCCC AAGCGGATGT CGGTAAAAAA GTGAAACTTT CATTGCTGGC GGCAGACACA 
GATCAAGGAA AACGAATTGC TGAATATGTT CAAAGTCAGT TGCAAGAAAA TCTGCCAGGT 
TTAGAAATTA CCATTTCATC GCAACCAAGT AATAATGTGA ACCAATCGCG ACGTGAAAAA 
AATTATGAGT TGTCTCTTTC AGGATGGATT GCCGGCAGTA GTGAATTAGA CTCTTACTTT 
AACTTATATG CAGGAGAATC AAGTTACAAT TACGGCAATT ATCATAATGC CAAATACGAC 
CAATTGGTAG AAGAGGCACG AACGATTAAT GCCAATAATC CAGAGAAACA GTTTGCAGAA 
TACAAAGAAG CGGAAGACAT CTTGTTGAAC CAAGATGCTG CCCAAGTACC GCTGTATCAA 
AGTGCCTCAA ATTATCTAAT CAATCCTAAA TTGAAAGGCA TTAGTTATCA CTTGTATGGG 
GATTATTTCC ACTTGCGCAA TGCCTATTTA ACAGAA 



EF012-4 (SEQ ID NO: 44) 

CGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ 

TTDKNTFTMA QHLFEGLYRF DDDSATVPAL AKDVKISDDG RKYHFTLREG IKWSNGEPIT 
AQDFVYSWKK LVTPATIGPN AYLLDSVKNS FEIRNGEKSV DELGISAPND KEFIVELKQA 
QPSFLAWSI AWLAPQNQKF VEAQGKDYAL DSEHLLYSGP FTLANWDATS DTWTLKKNPE 
YYDADQVKLE EVAVSTIKED NTGINLYQVN ELDLVRINGQ YVQQYQDDPG YVSHPDVANY 
FLDFNKKEGT PLANVHLRKA IGQAIDKEAL TQSVLNDGSK PLNGLIPSKL YANPETDEDF 
RAYSGEYLKN DVKKAQAEWT KAQADVGKKV KLSLLAADTD QGKRIAEYVQ SQLQENLPGL 
EITISSQPSN NVNQSRREKN YELSLSGWIA GSSELDSYFN LYAGESSYNY GNYHNAKYDQ 
LVEEARTINA NNPEKQFAEY KEAEDILLNQ DAAQVPLYQS ASNYLINPKL KGISYHLYGD 
YFHLRNAYLT E 



EF013-1 (SEQ ID NO:45) 

TAACGAAAAA TGAAAAAAAT TGCTTTGTTC AGTATGTTAA CGTTCAGTGT ATTGTCTTTA 
AGTCTAGCAG GATGTGGAAA CAAAAAAACA GCAAGCACAA ATGATTCTAA GCCAAAGCAA 
GAAACAAAGA AAGCCACGCA GAAATCCTCT AGCCAACAAG AAATGAAAAG TAGTCATTCG 
TCTGTCACGG GTCAAAATTC TAATGTGACA GGGGAAAATC CGTCAGAAAA TGCCACGCAG 
CCTTCTGCAG GAACTGATGA AACGAATGAA GTCCCTCAAA ACCAAGCACC TGATACAAAC 
ATTACAATTA CCAATGTTGT TTTCAATCCT GAAAGAAATG AAATTAATGG TACTACATTA 
CCTAATGCAA CCATTACAGC AACGGTAGTC GGTGATGCTT CTGCACAAGC AGGTGTTTTT 
TATGCGGATG CCAATGGCAA TTTTACAGTA ATTAGTCCCA GAGCGGGAGC GACTACTCAA 
TTAATCGCAA CCGTTGATCA ACGGAATAGT GCACCTGTCC AAATTGATAT TCCAAGTTCA 
GGACAAGAAG CAGCGCTTTC TTTTAGCAAT ATTACGATTG ATCCGAAACA AGGGACAATT 
TCTGGTAAAA CAGCACCGAA TGCAACTATT TTAGTGTCAC GTGCAGATGA TGCGCGGGTG 
ATTTTAGCAA GTTTTACTGC GGATGCCCAA GGGAATTTCA CAGCCAGTAA TTTAGTTCCC 
GGCACAAAAA ATCGCTTAGA TGTTACGTTA AATGGAGAAA TAGGGACACC TTACTTGTTT 
GATTTACCAA ATTAA 

EF013-2 (SEQ ID NO:46) 

MKKIALFS MLTFSVLSLS LAGCGNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS 
VTGQNSNVTG ENPSENATQP SAGTDETNEV PQNQAPDTNI TITNWFNPE RNEINGTTLP 
NATITATWG DASAQAGVFY ADANGNFTVI SPRAGATTQL lATVDQRNSA PVQIDIPSSG 
QEAALSFSNI TIDPKQGTIS GKTAPNATIL VSRADDARVI LASFTADAQG NFTASNLVPG 
TKNRLDVTLN GEIGTPYLFD LPN 
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EF013-3 (SEQ ID NO:47) 

ATGTGGAAA CAAAAAAACA GCAAGCACAA ATGATTCTAA GCCAAAGCAA 
GAAACAAAGA AAGCCACGCA GAAATCCTCT AGCCAACAAG AAATGAAAAG TAGTCATTCG 
TCTGTCACGG GTCAAAATTC TAATGTGACA GGGGAAAATC CGTCAGAAAA TGCCACGCAG 
CCTTCTGCAG GAACTGAa?GA AACGAATGAA GTCCCTCAAA ACCAAGCACC TGATACAAAC 
ATTACAATTA CCAATGTTGT TTTCAATCCT GAAAGAAATG AAATTAATGG TACTACATTA 
CCTAATGCAA CCATTACAGC AACGGTAGTC GGTGATGCTT CTGCACAAGC AGGTGTTTTT 
TATGCGGATG CCAATGGCAA TTTTACAGTA ATTAGTCCCA GAGCGGGAGC GACTACTCAA 
TTAATCGCAA CCGTTGATCA ACGGAATAGT GCACCTGTCC AAATTGATAT TCCAAGTTCA 
GGACAAGAAG CAGCGCTTTC TTTTAGCAAT ATTACGATTG ATCCGAAACA AGGGACAATT 
TCTGGTAAAA CAGCACCGAA TGCAACTATT TTAGTGTCAC GTGCAGATGA TGCGCGGGTG 
ATTTTAGCAA GTTTTACTGC GGATGCCCAA GGGAATTTCA CAGCCAGTAA TTTAGTTCCC 
GGCACAAAAA ATCGCTTAGA TGTTACGTTA AATGGAGAAA TAGGGACACC TTACTTGTTT 
GATTTACCAA AT 

EF013-4 (SEQ ID NO:48) 



CGNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS 

VTGQNSNVTG ENPSENATQP SAGTDETNEV PQNQAPDTNI TITNWFNPE RNEINGTTLP 
NATITATWG DASAQAGVFY ADANGNFTVI SPRAGATTQL lATVDQRNSA PVQIDIPSSG 
QEAALSFSNI TIDPKQGTIS GKTAPNATIL VSRADDARVI LASFTADAQG NFTASNLVPG 
TKNRLDVTLN GEIGTPYLFD LPN 

EF014-l'(SEQ ID NO:49) 



TGATGGTGGA GACTTTTTAA GAGAGAGGAA 
AGCTTAATTA GTTTAGTCAT CATTTTGGTT 
GTAGCGGGTA GCTATTTAAA GAAAACAATT 
TATAATGAAG CGCAAAATAA AGATAGTCAA 
ATTGAACGGA AATTAGGCAC AACTAGGACT 
AAGACGAAGA AAATAACCTA TTTAAGTTTG 
AAAAATTACC AAGGGATGCA GCGAATTGAA 
TCTGTTAACA CAGTTGAGAA ATTATTGAAT 
TTTTTATCTT TTATTAAGTT AATTGATGCG 
GCGTTTGATG GTGTCACCAA AGACGGGCCA 
CATTTAGATG GTACGAAAGC TTTATCTTAT 
ATGCGTGGAT TCCGACAACA AGAAATTATT 
CAATCAATCA TGAAAATAAT GGACATTATT 
GTGGATTCCA ATGAATTGAC TCATTTAGTC 
AAACAACAGC TTTCTTTTGA CTGGCGCACT 
CTATACCCAG ATAGTATTGA AAATGTCCGT 
AAGCCAGATG AACGAGATCA AGACGGCTAT 
CAAAGTGATT ATACCGTTCA AGATGAAGCA 
GGCAATACGT ATATTGGTGT TCCTGGTAAT 
ACGGAAAATG GCTTTATAAA ATAA 



GTACAGCCAA TGAGTAGGAA GCGAAAAATC 
TTTGTCACAG TCGGCTCAGC ATACTTTGCT 
GATAAAGGCT ATGTTCCCAT AAAAAATGAT 
TCGTTTTTGA TTATGGGGCT AGACAATACA 
GATGCTATGA TGGTGATTAC CGTGAATAAC 
CCACGGGATA GTTTTGTTCA AATTGATGCG 
GCCGCCTATA CCTACGATGG ACCAACAGCT 
ATTCCAATCA ATCATTACGT TGTGTTTAAC 
GTTGGCGGCA TAGATGTCAA TGTCAAGCAG 
GGATCCATTC ATTTTGATGC AGGGAAACAG 
GCCCGTGAAA GACATAGCGA TAACGATATT 
CAAGCAGTTG AAGACAAGTT GAAATCTGGT 
GATTCGTTAA ATGGAAACAT TCAAACTGAT 
AAAGAAGGTT TGACTTGGAC CAATTATGAT 
TTTAGTAATG AAGGGCGCAG TATGGTTGAA 
CATCAATTAC GTGTGTCTTT AAATTTAGAA 
GTCTTCCATA CGAACGGTGA ATTTTTATAT 
GCTGAGGAAA ACGAAATGAC TTCCATCAAC 
ACACAGACCG GCCCGTTGCC ATCAGTTAAA 



EF014-2 (SEQ ID NO:50) 

MSRKRKIS LISLVIILVF VTVGSAYFAV AGSYLKKTID KGYVPIKNDY 
NEAQNKDSQS FLIMGLDNTI ERKLGTTRTD AMMVITVNNK TKKITYLSLP RDSFVQIDAK 
NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI PINHYWFNF LSFIKLIDAV GGIDVNVKQA 
FDGVTKDGPG SIHFDAGKQH LDGTKALSYA RERHSDNDIM RGFRQQEIIQ AVEDKLKSGQ 
SIMKIMDIID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK QQLSFDWRTF SNEGRSMVEL 
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YPDSIENVRH QLRVSLNLEK PDERDQDGYV FHTNGEFLYQ SDYTVQDEAA EENEMTSING 
NTYIGVPGNT QTGPLPSVKT ENGFIK 

EF014-3 (SEQ ID NO; 51) 

TGCT 

GTAGCGGGTA GCTATTTAAA GAAAACAATT GATAAAGGCT ATGTTCCCAT AAAAAATGAT 
TATAATGAAG CGCAAAATAA AGATAGTCAA TCGTTTTTGA TTATGGGGCT AGACAATACA 
ATTGAACGGA AATTAGGCAC AACTAGGACT GATGCTATGA TGGTGATTAC CGTGAATAAC 
AAGACGAAGA AAATAACCTA TTTAAGTTTG CCACGGGATA GTTTTGTTCA AATTGATGCG 
AAAAATTACC AAGGGATGCA GCGAATTGAA GCCGCCTATA CCTACGATGG ACCAACAGCT 
TCTGTTAACA CAGTTGAGAA ATTATTGAAT ATTCCAATCA ATCATTACGT TGTGTTTAAC 
TTTTTATCTT ' TTATTAAGTT AATTGATGCG GTTGGCGGCA TAGATGTCAA TGTCAAGCAG 
GCGTTTGATG GTGTCACCAA AGACGGGCCA GGATCCATTC ATTTTGATGC AGGGAAACAG 
CATTTAGATG GTACGAAAGC TTTATCTTAT GCCCGTGAAA GACATAGCGA TAACGATATT 
ATGCGTGGAT TCCGACAACA AGAAATTATT CAAGCAGTTG AAGACAAGTT GAAATCTGGT 
CAATCAATCA TGAAAATAAT GGACATTATT GATTCGTTAA ATGGAAACAT TCAAACTGAT 
GTGGATTCCA ATGAATTGAC TCATTTAGTC AAAGAAGGTT TGACTTGGAC CAATTATGAT 
AAACAACAGC TTTCTTTTGA CTGGCGCACT TTTAGTAATG AAGGGCGCAG TATGGTTGAA 
CTATACCCAG ATAGTATTGA AAATGTCCGT CATCAATTAC GTCTCTCTTT AAATTTAGAA 
AAGCCAGATG AACGAGATCA AGACGGCTAT GTCTTCCATA CGAACGGTGA ATTTTTATAT 
CAAAGTGATT ATACCGTTCA AGATGAAGCA GCTGAGGAAA ACGAAATGAC TTCCATCAAC 
GGCAATACGT ATATTGGTGT TCCTGGTAAT ACACAGACCG GCCCGTTGCC ATCAGTTAAA 
ACGGAAAATG GCTTTATAAA A 



EF014-4 (SEQ ID NO: 52) 
AV AGSYLKKTID KGYVPIKNDY 

NEAQNKDSQS FLIMGLDNTI ERKLGTTRTD AMMVITVNNK TKKITYLSLP RDSFVQIDAK 
NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI PINHYWFNF LSFIKLIDAV GGIDVNVKQA 
FDGVTKDGPG SIHFDAGKQH LDGTKALSYA RERHSDNDIM RGFRQQEIIQ AVEDKLKSGQ 
SIMKIMDIID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK QQLSFDWRTF SNEGRSMVEL 
YPDSIENVRH QLRVSLNLEK PDERDQDGYV FHTNGEFLYQ SDYTVQDEAA EENEMTSING 
NTYIGVPGNT QTGPLPSVKT ENGFIK 

EF015-1 (SEQ ID NO: 53) 

TAATTAAAAA TGTGTAAAAA GGGTCTGATG AAAAAAGGAG ACATAATAGT TATTATCTTT 
TTAATAGCTA TCTCTTTTTC TCCATATTTT ATTTTTTTTC ACAATAATCC ATTTAACTCC 
AAAAGTTTTG ACGACACTAA ATATGCTGTG GTCAAGATAG ATGGGAAAGA GATTGAGCGT 
ATAAATTTAG ATGATTCAAA AGAATTTATC AAAACATATT ATCCATCAAA AGGGCAATAT 
AATACTATAG AAGTTAAAAA TGGGCACGTT CGTGTAAAAA AAGATAATAG TCCAGATCAA 
ATTGCGGTGA AAACAGGATG GATATCAGAA CCAGGGCNAA CTAGTATCTG TATTCCTCAC 
AGATTCATTT TAGAAATTGT TCAACAATAT TCTAAGGATT ATTATATTTA CTAA 

EF015-2 (SEQ ID NO: 54) 

MK KGDIIVIIFL lAISFSPYFI FFHNNPFNSK SFDDTKYAW KIDGKEIERI 
NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR VKKDNSPDQI AVKTGWISEP GXTSICIPHR 
FILEIVQQYS KDYYIY 

EF015-3 (SEQ ID NO: 55) 
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CAATAATCC ATTTAACTCC 

AAAAGTTTTG ACGACACTAA ATATGCTGTG GTCAAGATAG ATGGGAAAGA GATTGAGCGT 
ATAAATTTAG ATGATTCAAA AGAATTTATC AAAACATATT ATCCATCAAA AGGGCAATAT 
AATACTATAG AAGTTAAAAA TGGGCACGTT CGTGTAAAAA AAGATAATAG TCCAGATCAA 
ATTGCGGTGA AAACAGGATG GATATCAGAA CCAGGGCNAA CTAGTATCTG TATTCCTCAC 
AGATTCATTT TAGAAATTGT TCAACAATAT TCTAAGGATT ATTATATTTA C 



EF015-4 (SEQ ID NO:56) 

NNPFNSK SFDDTKYAW KIDGKEIERI 
NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR 
FILEIVQQYS KDYYIY 



VKKDNSPDQI AVKTGWISEP GXTSICIPHR 



EF016-1 (SEQ ID NO:57) 

TGACGGTTGC CCCCGTCCAA TAGAAAGGAG 
TTGCTGGTTA TCTGTTGTAG TTTACTCCTA 
GAAGATCAAT GGACACGGAT TAACGAAGAA 
TTTGTGCCCA TGGGTTTTCA AGATAAATCA 
GCCAAAGCGG TTTTTAAACT TTATGGCATT 
ATGAAAGAAA CAGAATTACA AAATCAAACC 
ACGAGCGAGC GGGCCGAAAA AGTTCAATTC 
CTTGTTTCTT TAAAAGAAAA AAACATTGCA 
GGGGTTCAAA ACGGCTCTTC TGGCTATGAT 
AAATTTGTTA AAGACCAAAC ACCTATTTTA 
TTAAAATCTG GTCGAATTGA CGGACTCCTA 
TCCCACGAAG ATAATTTAAA AAACTATACT 
TTTGCTGTGG GCGTCCGCAA ATCAGACAAT 
GAAACGTTAC GAAAAGATGG CACCCTTAGT 
GTTACAAATA ACACAAAAAT AAACTAA 



TTTATGATGA AAAAGAAATA TTCTTTAGCC 
TTTGCAGGTT GTGGTAAAAG AAAAAGCAAC 
AAACGGATTA TTATTGGCTT AGATGACTCC 
GGCAAAATTG TCGGCTTTGA TGTCGACTTA 
TCCGTTGACT TCCAACCGAT TGATTGGTCT 
ATTGATCTTA TTTGGAACGG CTACACTAAA 
ACACAACCTT ACATGACGAA CGACCAAGTA 
ACAGCGAGCG ACATGCAAGG CAAAATTTTA 
GGCTTCGAAA GTCAGCCTGA CGTTTTGAAA 
TATGACGGCT TTAATGAAGC TTTCTTAGAT 
ATCGATCGCG TTTACGCCAA CTACTATCTT 
ATTTCTCATG TAGGCTATGA CAATGAAGAT 
CAATTAGTCC AAAAAATCAA TACTGCCTTT 
AAAATTTCTC AAAAATGGTT TGGAGAGGAC 



EF016-2 (SEQ ID NO: 58) 

MMKKKYSLAL LVICCSLLLF AGCGKRKSNE DQWTRINEEK RIIIGLDDSF 
VPMGFQDKSG KIVGFDVDLA KAVFKLYGIS VDFQPIDWSM KETELQNQTI DLIWNGYTKT 
SERAEKVQFT QPYMTNDQVL VSLKEKNIAT ASDMQGKILG VQNGSSGYDG FESQPDVLKK 
FVKDQTPILY DGFNEAFLDL KSGRIDGLLI DRVYANYYLS HEDNLKNYTI SHVGYDNEDF 
AVGVRKSDNQ LVQKINTAFE TLRKDGTLSK ISQKWFGEDV TNNTKIN 



EF016-3 (SEQ ID NO: 59) 



AAGCAAC 

GAAGATCAAT GGACACGGAT TAACGAAGAA 
TTTGTGCCCA TGGGTTTTCA AGATAAATCA 
GCCAAAGCGG TTTTTAAACT TTATGGCATT 
ATGAAAGAAA CAGAATTACA AAATCAAACC 
ACGAGCGAGC GGGCCGAAAA AGTTCAATTC 
CTTGTTTCTT TAAAAGAAAA AAACATTGCA 
GGGGTTCAAA ACGGCTCTTC TGGCTATGAT 
AAATTTGTTA AAGACCAAAC ACCTATTTTA 
TTAAAATCTG GTCGAATTGA CGGACTCCTA 



AAACGGATTA TTATTGGCTT AGATGACTCC 
GGCAAAATTG TCGGCTTTGA TGTCGACTTA 
TCCGTTGACT TCCAACCGAT TGATTGGTCT 
ATTGATCTTA TTTGGAACGG CTACACTAAA 
ACACAACCTT ACATGACGAA CGACCAAGTA 
ACAGCGAGCG ACATGCAAGG CAAAATTTTA 
GGCTTCGAAA GTCAGCCTGA CGTTTTGAAA 
TATGACGGCT TTAATGAAGC TTTCTTAGAT 
ATCGATCGCG TTTACGCCAA CTACTATCTT 
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TCCCACGAAG ATAATTTAAA AAACTATACT 
TTTGCTGTGG GCGTCCGCAA ATCAGACAAT 
GAAACGTTAC GAAAAGATGG CACCCTTAGT 
GTTACAAATA ACACAAAAAT AAAC 



ATTTCTCATG TAGGCTATGA CAATGAAGAT 
CAATTAGTCC AAAAAATCAA TACTGCCTTT 
AAAATTTCTC AAAAATGGTT TGGAGAGGAC 



EF016-4 (SEQ ID NO: 60) 



SNE DQWTRINEEK RIIIGLDDSF 
VPMGFQDKSG KIVGFDVDLA KAVFKLYGIS 
SERAEKVQFT QPYMTNDQVL VSLKEKNIAT 
FVKDQTPILY DGFNEAFLDL KSGRIDGLLI 
AVGVRKSDNQ LVQKINTAFE TLRKDGTLSK 



VDFQPIDWSM KETELQNQTI DLIWNGYTKT 
ASDMQGKILG VQNGSSGYDG FESQPDVLKK 
DRVYANYYLS HEDNLKNYTI SHVGYDNEDF 
ISQKWFGEDV TNNTKIN 



EF017-1 (SEQ ID NO: 61) 



TGAGGTGTTT TTATGAAAAG GGCAACAAAG 
CTACTTCTCT CGGGCTGTGG AAGTGTTGGG 
TTACGGGTCG GGATTGATTC GGAATTATCA 
ACCGCAGCAG ATGTAATGAG CCAAGTAGGG 
GAAGCGAAAC CAGCATTGGC AACTGAAAAA 
ACTTTTACGA TTCGAAAAGA TGCAAAATGG 
TTTGAATACT CTTGGAAGCG CACAGTGGAC 
TTTGAAGGGT TAAAAAATTA TCGTGCTATT 
GGGGTAACAG CCATTGATGA CCATACCTTG 
TTTCAACAAT TATTGGCGGT ACCAGCTTTT 
ACGGGCAAAA ACTATGGTAC ATCAGCTGAG 
GAAGGTTGGG ATGGCACGAA TAATACTTGG 
CAAGCGAATG TTTCGCTAGA TAAGGTGGAT 
AAAAATCTTT TCGAAGGGAA AGAATTAGAT 
CAAGAACAAG GCAATGCAGC TTTGAAAATT 
TTAAATACGC AAAAAGATCT TTTGGCAAAT 
TTGAATTCTG AGCGTTTAGC TAAAAATGTT 
TTCGTGCCAA CAGGTTTCAC TAATCAAGAA 
GATTTAAATC CTAGTGAACC AGAAAAAGCG 
TTAGGAATTG AAAAAGCGGA GCTAACGATT 
ATCAGTGAGT ATGTTCAAGG AGCTTTAGCA 
TCACCAGTTC CTTTTAATAA TCGTTTAGAA 
GTTGGTGGCT GGACGCCAGT ATATGCTGAT 
AAAAATTCCA ATAATTTTGG TAAATGGTCT 
GCAAACGTAA CTTATGCAAA TAAATATGAA 
CAATTGGTTG CGGAAGAAGC CCCCCTAGTT 
GTGGCCGATT CTGTCCAAAA TTTAGTCTAT 
GTCTCTATCG GCGACAAGTA A 



CAAAGGCTGT CTTTGGCAGC AATCATGGTT 
AAAGAAACCA AAAAGCAAGA ACAACAGGTA 
ACGGCAGACG TGTCGTTGGC AATGGATAAT 
GAGGGACTTT TCTCCTTTGA CGAAAAAGGA 
GTACAGCCCT CCAATGATGG TTTAAGCTAT 
AGTAACGGCG AGCCAATCAC AGCAAATGAT 
CCAAAAACAG CTTCCCCGCA AGCGTATTAC 
GTTGACGGTA GCAAATCTAA AGAAGAGTTA 
GAAGTAGAGC TAAGCTATCC TATGAGTTAT 
TATCCTTTAA ATGAAGCATT TGTCGAAAAA 
TCAACACTTT ACAATGGCGC CTTCACATTA 
TCCTATGTGA AGAATAAAAA TTATTGGGAT 
GTCCAAGTAG TTAAAGAAGT CAATACTGGG 
GTTGTAAAAA TTTCTGGAGA AATTGTTGCA 
CGTGAAATTC CTGGAACGTA TTATATCCAA 
AAGAATGCAC GTCGAGCAAT AGCATTATCA 
TTAAATGATG GCTCAAAAAA AGCACTTGGC 
ACGCAAAAAG ATTTTGCAGA GGAATTAGGA 
AAAGAGTTAT GGCAAACGGC TAAAAAAGAA 
TTAAGTTCGG ATACAGAAAA TGCTAAAAAA 
GATAATTTAG AAAATTTAAC AGTCAATGTT 
AAAAGTCGCA GCGGAGATTT CGACATTGTG 
CCAATCGATT TCTTAAACTT ACTGCAATCA 
AATAAGACCT TTGATCAGTT tSCTTCAAGAA 
GAACGTTGGA AAACATTACA AAAAGCGGAT 
CCTCTTTATC AATTAACAGA AGCACGCTTA 
GGTCCATTAG GTTCAGGCTA TTACAAATCA 



EF017-2 (SEQ ID NO: 62) 

MKRATKQ RLSLAAIMVL LLSGCGSVGK ETKKQEQQVL RVGIDSELST ADVSLAMDNT 
AADVMSQVGE GLFSFDEKGE AKPALATEKV QPSNDGLSYT FTIRKDAKWS NGEPITANDF 
EYSWKRTVDP KTASPQAYYF EGLKNYRAIV DGSKSKEELG VTAIDDHTLE VELSYPMSYF 
QQLU^VPAFY PLNEAFVEKT GKNYGTSAES TLYNGAFTLE GWDGTNNTWS YVKNKNYWDQ 
ANVSLDKVDV QWKEVNTGK NLFEGKELDV VKISGEIVAQ EQGNAALKIR EIPGTYYIQL 
NTQKDLLANK NARRAIALSL NSERLAKNVL NDGSKKALGF VPTGFTNQET QKDFAEELGD 
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LNPSEPEKAK ELWQTAKKEL GIEKAELTIL SSDTENAKKI SEYVQGALAD NLENLTVNVS 
PVPFNNRLEK SRSGDFDIW GGWTPVYADP IDFLNLLQSK NSNNFGKWSN KTFDQLLQEA 
NVTYANKYEE RWKTLQKADQ LVAEEAPLVP LYQLTEARLV ADSVQNLVYG PLGSGYYKSV 
SIGDK 

EF017-3 (SEQ ID NO: 63) 

CTGTGG AAGTGTTGGG AAAGAAACCA AAAAGCAAGA ACAACAGGTA 

TTACGGGTCG GGATTGATTC GGAATTATCA ACGGCAGACG TGTCGTTGGC AATGGATAAT 
ACCGCAGCAG ATGTAATGAG CCAAGTAGGG GAGGGACTTT TCTCCTTTGA CGAAAAAGGA 
GAAGCGAAAC CAGCATTGGC AACTGAAAAA GTACAGCCCT CCAATGATGG TTTAAGCTAT 
ACTTTTACGA TTCGAAAAGA TGCAAAATGG AGTAACGGCG AGCCAATCAC AGCAAATGAT 
TTTGAATACT CTTGGAAGCG CACAGTGGAC CCAAAAACAG CTTCCCCGCA AGCGTATTAC 
TTTGAAGGGT TAAAAAATTA TCGTGCTATT GTTGACGGTA GCAAATCTAA AGAAGAGTTA 
GGGGTAACAG CCATTGATGA CCATACCTTG GAAGTAGAGC TAAGCTATCC TATGAGTTAT 
TTTCAACAAT TATTGGCGGT ACCAGCTTTT TATCCTTTAA ATGAAGCATT TGTCGAAAAA 
ACGGGCAAAA ACTATGGTAC ATCAGCTGAG TCAACACTTT ACAATGGCGC CTTCACATTA 
GAAGGTTGGG ATGGCACGAA TAATACTTGG TCCTATGTGA AGAATAAAAA TTATTGGGAT 
CAAGCGAATG TTTCGCTAGA TAAGGTGGAT GTCCAAGTAG TTAAAGAAGT CAATACTGGG 
AAAAATCTTT. TCGAAGGGAA AGAATTAGAT GTTGTAAAAA TTTCTGGAGA AATTGTTGCA 
CAAGAACAAG GCAATGCAGC TTTGAAAATT CGTGAAATTC CTGGAACGTA TTATATCCAA 
TTAAATACGC AAAAAGATCT TTTGGCAAAT AAGAATGCAC GTCGAGCAAT AGCATTATCA 
TTGAATTCTG AGCGTTTAGC TAAAAATGTT TTAAATGATG GCTCAAAAAA AGCACTTGGC 
TTCGTGCCAA CAGGTTTCAC TAATCAAGAA ACGCAAAAAG ATTTTGCAGA GGAATTAGGA 
GATTTAAATC CTAGTGAACC AGAAAAAGCG AAAGAGTTAT GGCAAACGGC TAAAAAAGAA 
TTAGGAATTG AAAAAGCGGA GCTAACGATT TTAAGTTCGG ATACAGAAAA OXSCTAAAAAA 
ATCAGTGAGT ATGTTCAAGG AGCTTTAGCA GATAATTTAG AAAATTTAAC AGTCAATGTT 
TCACCAGTTC CTTTTAATAA TCGTTTAGAA AAAAGTCGCA GCGGAGATTT CGACATTGTG 
GTTGGTGGCT GGACGCCAGT ATATGCTGAT CCAATCGATT TCTTAAACTT ACTGCAATCA 
AAAAATTCCA ATAATTTTGG TAAATGGTCT AATAAGACCT TTGATCAGTT GCTTCAAGAA 
GCAAACGTAA CTTATGCAAA TAAATATGAA GAACGTTGGA AAACATTACA AAAAGCGGAT 
CAATTGGTTG CGGAAGAAGC CCCCCTAGTT CCTCTTTATC AATTAACAGA AGCACGCTTA 
GTGGCCGATT CTGTCCAAAA TTTAGTCTAT GGTCCATTAG GTTCAGGCTA TTACAAATCA 
GTCTCTATCG GCGACAAG 



EF017-4 (SEQ ID NO: 64) 

CGSVGK ETKKQEQQVL RVGIDSELST ADVSLAMDNT 

AADVMSQVGE GLFSFDEKGE AKPALATEKV QPSNDGLSYT FTIRKDAKWS NGEPITANDF 
EYSWKRTVDP KTASPQAYYF EGLKNYRAIV DGSKSKEELG VTAIDDHTLE VELSYPMSYF 
QQLLAVPAFY PLNEAFVEKT GKNYGTSAES TLYNGAFTLE GWDGTNNTWS YVKNKNYWDQ 
ANVSLDKVDV QWKEVNTGK NLFEGKELDV VKISGEIVAQ EQGNAALKIR EIPGTYYIQL 
NTQKDLLANK NARRAIALSL NSERLAKNVL NDGSKKALGF VPTGFTNQET QKDFAEELGD 
LNPSEPEKAK ELWQTAKKEL GIEKAELTIL SSDTENAKKI SEYVQGALAD NLENLTVNVS 
PVPFNNRLEK SRSGDFDIW GGWTPVYADP IDFLNLLQSK NSNNFGKWSN KTFDQLLQEA 
NVTYANKYEE RWKTLQKADQ LVAEEAPLVP LYQLTEARLV ADSVQNLVYG PLGSGYYKSV 
SIGDK 



EF018-1 (SEQ ID NO: 65) 

TGTCATTACA ACGATACCAA TTTTAATCAT 
CGGTATGATG GCCGGTGCAG TAAAAGAATA 



TTATCCATTA CTACAAAAAC ACTTTATCGG 
AAGAAAGTAG GGAACAATAT GAAAAAAGTT 
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TTAGGCGGTT TATTGGTGGC AACGGCGGTC GTTAGTTTAG CGGCCTGTAG CGGTGGGGAA 
AAGAAAGCTA GCTCAGATGT CTCAATTAAG GATCGGTATG AATTAGATGA AAAGACGCCT 
GCTTGGAAGT TAGATAAGAA GAAAGAACCG ACCAAGATTA AATGGTATAT TAACTCAGAT 
TGGACGGCGC TGCCTTTTGG AAAAGACGTG ACCACTGCGC AGATTAAAAA AGACTTAAAT 
GTGGATATTG AATTTATTTC CGGCGATGAT TCAAAATTAA ATGCCATGAT TTCAAGTGGA 
GATATGCCTG ATATCGTGAC ATTAACTGAA AAAACTGGAC AAGCAGCATT GAAAGCAGAT 
TCTTGGGCCT ATTCTTTAAA CGATTTAGCT AAAAAATATG ACCCCTATTT AATGAAAGTT 
GTTAACCAAG ATACGTTTAA ATGGTATGCC TTAGAGGATG GAAAAACATA TGGTTACCCT 
AATTACTCTA ATACAAAAGC GGATTATGAA AGTGGAAATA TCCCAGTAAA TGATAATTTT 
GTTATTCGTG AAGATGTCTA TAATGCATTA GGCAAGCCAG ACGTTTCAAC ACCAGAAAAT 
TTTGAAAAAG TCATGCAACA GATTAAAGAA AAATATCCTG AGATGACCCC AATGGGCTTC 
ACCACAGTGG GCGATGGTGC AGGACCATTT TTAGACAAAT TACAAGACTT CTTAGGTGTT 
CCTTTAGAGG ATAAAAATGG TAAATACTAT GATCGAAATT TAGATAAAGA ATATTTAGAA 
TGGTTAAAAA CATTTAATGA TGTTTACCGA GCAGGCAATA TTAGTGATGA TAGCTTCACA 
GATGATGGGG CAACGTTTGA TGAAAAAGTG AAACAAGGAA ATTATGCAAC CATGCTCGTT 
GCTGGAACCA GTGGTCAAGG TGGGAACTTC ACAGAATTTA TGAAAAAATC TGGCACACGT 
TATATAGCCA TTGATGGACC AAGTAGCACT TCTGGCCGAA AACCAACATT AAATCAAACC 
GGCATTTCAG GTTGGTTAAG TAATTACATT ACGAAAGATG CGAAAGATCC AGCAAAAGTC 
ACTCAACTGT TCACATATTT AATTGATGAA CCGGGACAAA TTTTAACAAA ATATGGCGTT 
GAAGGAGTTA CTTATGCGTA CAATGATCAA GGAAAAATTG ATTATTTACC AGAAGTGAAA 
AAATTAGAAC AAACAGACAA TGATGCCTAC AACAAAAAAT ATGGCATTAG TCGTTTCCTA 
TACTTTAACA ACGACCGTGT CAATAAACTA AAAGTACCAA TGGAAAGTGC TTTAACGCAA 
ATGCAAGAAT GGGGCAAAGG AAAATTAGTC CCACATTTCG TAATTGAAAA TATTAATCCA 
GATGCAGGAA CGCCGGAAGC TCGTGCGAAT GAAGCGATTG AAACCAAACT AAATACAACC 
GTTATTTCAA TGATTCGTGC GAAAGATGAT AAAGCCTTTG ACAAATCTTT AGAAGACTAC 
AAAGCATTCT TAAAATCAAA TAAATGGGAT GCAATTGAAA AAATAAAATC TGAGAAAATG 
GCGGAAAACA GAGACAAACT TAAGTAA 

EF018-2 (SEQ ID NO: 66) 

MKKV LGGLLVATAV VSLAACSGGE 

KKASSDVSIK DRYELDEKTP AWKLDKKKEP TKIKWYINSD WTALPFGKDV TTAQIKKDLN 
VDIEFISGDD SKLNAMISSG DMPDIVTLTE KTGQAALKAD SWAYSLNDLA KKYDPYLMKV 
VNQDTFKWYA LEDGKTYGYP NYSNTKADYE SGNIPVNDNF VIREDVYNAL GKPDVSTPEN 
FEKVMQQIKE KYPEMTPMGF TTVGDGAGPF LDKLQDFLGV PLEDKISFGKYY DRNLDKEYLE 
WLKTFNDVYR AGNISDDSFT DDGATFDEKV KQGNYATMLV AGTSGQGGNF TEFMKKSGTR 
YIAIDGPSST SGRKPTLNQT GISGWLSNYI TKDAKDPAKV TQLFTYLIDE PGQILTKYGV 
EGVTYAYNDQ GKIDYLPEVK KLEQTDNDAY NKKYGISRFL YFNNDRVNKL KVPMESALTQ 
MQEWGKGKLV PHFVIENINP DAGTPEARAN EAIETKLNTT VISMIRAKDD KAFDKSLEDY 
KAFLKSNKWD AIEKIKSEKM AENRDKLK 

EF018-3 (SEQ ID NO: 67) 

CTGTAG CGGTGGGGAA 

AAGAAAGCTA GCTCAGATGT CTCAATTAAG GATCGGTATG AATTAGATGA AAAGACGCCT 
GCTTGGAAGT TAGATAAGAA GAAAGAACCG ACCAAGATTA AATGGTATAT TAACTCAGAT 
TGGACGGCGC TGCCTTTTGG AAAAGACGTG ACCACTGCGC AGATTAAAAA AGACTTAAAT 
GTGGATATTG AATTTATTTC CGGCGATGAT TCAAAATTAA ATGCCATGAT TTCAAGTGGA 
GATATGCCTG ATATCGTGAC ATTAACTGAA AAAACTGGAC AAGCAGCATT GAAAGCAGAT 
TCTTGGGCCT ATTCTTTAAA CGATTTAGCT AAAAAATATG ACCCCTATTT AATGAAAGTT 
GTTAACCAAG ATACGTTTAA ATGGTATGCC TTAGAGGATG GAAAAACATA TGGTTACCCT 
AATTACTCTA ATACAAAAGC GGATTATGAA AGTGGAAATA TCCCAGTAAA TGATAATTTT 
GTTATTCGTG AAGATGTCTA TAATGCATTA GGCAAGCCAG ACGTTTCAAC ACCAGAAAAT 
TTTGAAAAAG TCATGCAACA GATTAAAGAA AAATATCCTG AGATGACCCC AATGGGCTTC 
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ACCACAGTGG GCGATGGTGC AGGACCATTT 
CCTTTAGAGG ATAAAAATGG TAAATACTAT 
TGGTTAAAAA CATTTAATGA TGTTTACCGA 
GATGATGGGG CAACGTTTGA TGAAAAAGTG 
GCTGGAACCA GTGGTCAAGG TGGGAACTTC 
TATATAGCCA TTGATGGACC AAGTAQCACT 
GGCATTTCAG GTTGGTTAAG TAATTACATT 
ACTCAACTGT TCACATATTT AATTGATGAA 
GAAGGAGTTA CTTATGCGTA CAATGATCAA 
AAATTAGAAC AAACAGACAA TGATGCCTAC 
TACTTTAACA ACGACCGTGT CAATAAACTA 
ATGCAAGAAT GGGGCAAAGG AAAATTAGTC 
GATGCAGGAA CGCCGGAAGC TCGTGCGAAT 
GTTATTTCAA TGATTCGTGC GAAAGATGAT 
AAAGCATTCT TAAAATCAAA TAAATGGGAT 
GCGGAAAACA GAGACAAACT TAAG 



TTAGACAAAT TACAAGACTT CTTAGGTGTT 
GATCGAAATT TAGATAAAGA ATATTTAGAA 
GCAGGCAATA TTAGTGATGA TAGCTTCACA 
AAACAAGGAA ATTATGCAAC CATGCTCGTT 
ACAGAATTTA TGAAAAAATC TGGCACACGT 
TCTGGCCGAA AACCAACATT AAATCAAACC 
ACGAAAGATG CGAAAGATCC AGCAAAAGTC 
CCGGGACAAA TTTTAACAAA ATATGGCGTT 
GGAAAAATTG ATTATTTACC AGAAGTGAAA 
AACAAAAAAT ATGGCATTAG TCGTTTCCTA 
AAAGTACCAA TGGAAAGTGC TTTAACGCAA 
CCACATTTCG TAATTGAAAA TATTAATCCA 
GAAGCGATTG AAACCAAACT AAATACAACC 
AAAGCCTTTG ACAAATCTTT AGAAGACTAC 
GCAATTGAAA AAATAAAATC TGAGAAAATG 



EF018-4 (SEQ ID NO: 68) 



CSGGE 

KKASSDVSIK DRYELDEKTP AWKLDKKKEP 
VDIEFISGDD SKLNAMISSG DMPDIVTLTE 
VNQDTFKWYA LEDGKTYGYP NYSNTKADYE 
FEKVMQQIKE KYPEMTPMGF TTVGDGAGPF 
WLKTFNDVYR AGNISDDSFT DDGATFDEKV 
YIAIDGPSST SGRKPTLNQT GISGWLSNYI 
EGVTYAYNDQ GKIDYLPEVK KLEQTDNDAY 
MQEWGKGKLV PHFVIENINP DAGTPEARAN 
KAFLKSNKWD AIEKIKSEKM AENRDKLK 



EF019-1 (SEQ ID NO:69) 



TKIKWYINSD WTALPFGKDV TTAQIKKDLN 
KTGQAALKAD SWAYSLNDLA KKYDPYLMKV 
SGNIPVNDNF VIREDVYNAL GKPDVSTPEN 
LDKLQDFLGV PLEDKNGKYY DRNLDKEYLE 
KQGNYATMLV AGTSGQGGNF TEFMKKSGTR 
TKDAKDPAKV TQLFTYLIDE PGQILTKYGV 
NKKYGISRFL YFNNDRVNKL KVPMESALTQ 
EAIETKLNTT VISMIRAKDD KAFDKSLEDY 



TAAAGGAGTT ACACAATGAA 
CTTGGTTCAT TCTTACTCGC 
AAAACACATG AAGTAACAGA 
CGGATTATTG CGAGTTATTT 
CAATGGACAG TTGGACAAGG 
CCCACTATTT CCTATGACTT 
TTAATCAGTT CATCTGCTCT 
CCAACTTATG TAGTCAAAAA 
GCCACTGTTT TAGATAAAAA 
ACCAAAGGCG TCCAAGAATA 
TGGGTAACCA ACAACCAAGT 
TATCAGGACT TAGGCCTCCA 
GCGGATTGGA ATCAAGTTTC 
CTTGTAAACA GCGATGAATC 
GCTGTGAAAA ATAACCAAGT 
CCTATTGCGA ATACTCAAAT 



ACTTTTAAAA AAGACGGTCC 
AGCTTGTGGT AATACGAATA 
TACCTTAGGC AATAAAGTAA 
AGAAGATTAT CTAGTTGCAT 
CAGCATTCAA GATTATTTAG 
GCCATATGAA GCGGTTCTAA 
AGTTGAAGGC GGTAAATACA 
CGGCGAAAAT GTCACCTGGC 
AGAACAAGCG AAAAAAGTGT 
TCTTGGCAAA AAAGATGCTG 
CTTTATGGTT AGCGATAATC 
AGTTCCAAAA TTAGTGGAAG 
TTTAGAAAAA TTAGCTGAGC 
AGCACCTCTT TTCCAAGAAG 
TCATACCTAT GATAAAAAAA 
TGTTGAAGAT GTAAAAAAAG 



TAATTGGTAC AACCCTTCTT 
AAGAAGCCAA CAACGCTGAC 
CCGTCCCCGC GAAACCCAAA 
TAGGAGAAAA ACCAGTGGCA 
CGAAAGAATT GAAAGATGTC 
AATTTGAACC TGACTTATTA 
AAGAATACAG TAAAATTGCG 
GTGATCAATT GGAAGATATT 
TAGAAGATTA TGATACCTTA 
GCAAATCTGC GGCAGTCTTA 
GCTCAAGCGG AACCGTGCTC 
AAATTTCTAA AAACGCTACT 
TTGACGCAGA CCACATTTTC 
CAATTTGGAA GAACTTACCT 
GTAGTTGGTT ATACAACGGA 
CGCTCTTAAA TTAA 



EF019-2 ((SEQ ID NO:70) 



MKLLKK TVLIGTTLLL GSFLLAACGN TNKEANNADK THEVTDTLGN KVTVPAKPKR 
IIASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP TISYDLPYEA VLKFEPDLLL 
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ISSSALVEGG KYKEYSKIAP TYWKNGENV TWRDQLEDIA TVLDKKEQAK KVLEDYDTLT 
KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY QDLGLQVPKL VEEISKNATA 
DWNQVSLEKL AELDADHIFL VNSDESAPLF QEAIWKNLPA VKNNQVHTYD KKSSWLYNGP 
lANTQIVEDV KKALLN 

EF019-3 (SEQ ID N0:71) 

TTGTGGT AATACGAATA AAGAAGCCAA CAACGCTGAC 

AAAACACATG AAGTAACAGA TACCTTAGGC AATAAAGTAA CCGTCCCCGC GAAACCCAAA 
CGGATTATTG CGAGTTATTT AGAAGATTAT CTAGTTGCAT TAGGAGAAAA ACCAGTGGCA 
CAATGGACAG TTGGACAAGG CAGCATTCAA GATTATTTAG CGAAAGAATT GAAAGATGTC 
CCCACTATTT CCTATGACTT GCCATATGAA GCGGTTCTAA AATTTGAACC TGACTTATTA 
TTAATCAGTT CATCTGCTCT AGTTGAAGGC GGTAAATACA AAGAATACAG TAAAATTGCG 
CCAACTTATG TAGTCAAAAA CGGCGAAAAT GTCACCTGGC GTGATCAATT GGAAGATATT 
GCCACTGTTT TAGATAAAAA AGAACAAGCG AAAAAAGTGT TAGAAGATTA TGATACCTTA 
ACCAAAGGCG TCCAAGAATA TCTTGGCAAA AAAGATGCTG GCAAATCTGC GGCAGTCTTA 
TGGGTAACCA ACAACCAAGT CTTTATGGTT AGCGATAATC GCTCAAGCGG AACCGTGCTC 
TATCAGGACT TAGGCCTCCA AGTTCCAAAA TTAGTGGAAG AAATTTCTAA AAACGCTACT 
GCGGATTGGA ATCAAGTTTC TTTAGAAAAA TTAGCTGAGC TTGACGCAGA CCACATTTTC 
CTTGTAAACA GCGATGAATC AGCACCTCTT TTCCAAGAAG CAATTTGGAA GAACTTACCT 
GCTGTGAAAA ATAACCAAGT TCATACCTAT GATAAAAAAA GTAGTTGGTT ATACAACGGA 
CCTATTGCGA ATACTCAAAT TGTTGAAGAT GTAAAAAAAG CGCTCTTAAA T 



EF019-4 (SEQ ID NO:72) 

CGN TNKEANNADK THEVTDTLGN KVTVPAKPKR 

IIASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP TISYDLPYEA VLKFEPDLLL 
ISSSALVEGG KYKEYSKIAP TYWKNGENV TWRDQLEDIA TVLDKKEQAK KVLEDYDTLT 
KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY QDLGLQVPKL VEEISKNATA 
DWNQVSLEKL AELDADHIFL VNSDESAPLF QEAIWKNLPA VKNNQVHTYD KKSSWLYNGP 
lANTQIVEDV KKALLN 



EF020-1 (SEQ ID NO:73) 

TGAGGAGATG AGAAAATGAA AAAGGTAGTT TCAATTTTGT TGATGGTTGT TGCAGTCTTC 
ACATTAACTG CATGTAATGG TTCTAAATTA GATAAAACAG GTGAAGAATT TAAAAATTCT 
ATAATGAAAG ATTCTTCATA TGGTGATGAA TATTCAGAAG ATGGTTTTAG TTTTTTAATA 
TATAAAGATA AAGACACTAA TCGTTATTTG GCTGATGTTT GGGTTCCTGT TAAAGATGAA 
ACTAGCGCAT TGGAGTATTT TTATTATTAT GATGAAGATA AGCGATTAGA TAGTACTAAA 
AGTAAAGTAA CCTTTGATGA TATGAAAGCT AGTGGAAACT ATGAAGTAGT GTATAAATCA 
GGGAAATTTA AATAA 

EF020-2 (SEQ ID NO: 74) 

MKKWS ILLMWAVFT LTACNGSKLD KTGEEFKNSI MKDSSYGDEY SEDGFSFLIY 
KDKDTNRYLA DVWVPVKDET SALEYFYYYD EDKRLDSTKS KVTFDDMKAS GNYEWYKSG 
KFK 



EF020-3 (SEQ ID NO: 75) 

ATGTAATGG TTCTAAATTA GATAAAACAG GTGAAGAATT TAAAAATTCT 
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ATAATGAAAG ATTCTTCATA TGGTGATGAA TATTCAGAAG ATGGTTTTAG TTTTTTAATA 
TATAAAGATA AAGACACTAA TCGTTATTTG GCTGATGTTT GGGTTCCTGT TAAAGATGAA 
ACTAGCGCAT TGGAGTATTT TTATTATTAT GATGAAGATA AGCGATTAGA TAGTACTAAA 
AGTAAAGTAA CCTTTGATGA TATGAAAGCT AGTGGAAACT ATGAAGTAGT GTATAAATCA 
GGGAAATTTA AA 

EF020-4 (SEQ ID NO:76) 

CNGSKLD KTGEEFKNSI MKDSSYGDEY SEDGFSFLIY 

KDKDTNRYLA DVWVPVKDET SALEYFYYYD EDKRLDSTKS KVTFDDMKAS GNYEWYKSG 
KFK 



EF021-1 (SEQ ID NO:77) 

TAGTTGTTTA AATACATTAA ACTATTTTTA GGAGGCTTTA CAGAAATGAA AAAAGCAAAA 
TTATTCGGTT TTAGTTTGAT TGCATTAGGT TTATCAGTTT CACTTGCAGC ATGTGGTGGT 
GGCAAAGGCA AAACCGCTGA AAGCGGCGGT GGCAAAGGGG ATGCAGCGCA TAGTGCTGTA 
ATCATTACAG ATACAGGCGG CGTGGATGAC AAGTCGTTCA ACCAATCTTC TTGGGAAGGA 
TTGCAAGCTT GGGGTAAAGA ACATGATTTA CCAGAAGGTT CAAAAGGGTA TGCATATATT 
CAATCGAATG ATGCAGCTGA CTATACAACC, AATATT6ACC AAGCGGTATC AAGTAAATTC 
AACACAATCT TTGGTATTGG CTACTTGCTA AAAGATGCAA TTTCTTCTGC AGCAGATGCC 
AACCCTGATA CAAACTTTGT TTTAATCGAT GATCAAATCG ATCGCAAAAA GAATGTCGTT 
TCTGCAACAT TTAGAGATAA TGAAGCAGCT TACTTAGCCG GTGTTGCTGC TGCAAATGAA 
ACAAAAACGA ACAAAGTCGG TTTTGTTGGT GGTGAAGAAG GGGTCGTAAT TGACCGTTTC 
CAAGCTGGTT TTGAAAAAGG TGTGGCTGAT GCTGCGAAAG AATTAGGTAA AGAAATTACT 
GTTGATACGA AATATGCGGC TTCATTTGCT GATCCTGCCA AAGGGAAAGC TTTAGCTGCT 
GCAATGTACC AAAACGGCGT TGATATCATC TTCCATGCTT CTGGTGCGAC TGGACAAGGG 
GTCTTCCAAG AAGCAAAAGA CTTGAATGAA TCAGGTTCTG GCGACAAAGT TTGGGTAATC 
GGCGTTGACC GCGATCAAGA TGCTGATGGC AAGTACAAAA CAAAAGACGG CAAAGAAGAC 
AACTTCACGT TAACTTCAAC GCTTAAAGGT GTCGGCACAG CGGTTCAAGA TATTGCCAAC 
CGTGCGTTAG AAGACAAATT CCCTGGTGGC GAACATTTAG TTTATGGATT AAAAGATGGT 
GGCGTTGACT TAACAGACGG CTATTTAAAC GACAAAACAA AAGAAGCTGT TAAAACAGCA 
AAAGATAAAG TAATCTCAGG TGACGTAAAA GTCCCAGAAA AACCAGAATA A 



EF021-2 (SEQ ID NO:78) 

MKKAKL FGFSLIALGL SVSLAACGGG KGKTAESGGG KGDAAHSAVI 

ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP EGSKGYAYIQ SNDAADYTTN IDQAVSSKFN 
TIFGIGYLLK DAISSAADAN PDTNFVLIDD QIDGKKNWS ATFRDNEAAY LAGVAAANET 
KTNKVGFVGG EEGWIDRFQ AGFEKGVADA AKELGKEITV DTKYAASFAD PAKGKALAAA 
MYQNGVDIIF HASGATGQGV FQEAKDLNES GSGDKVWVIG VDRDQDADGK YKTKDGKEDN 
FTLTSTLKGV GTAVQDIANR ALEDKFPGGE HLVYGLKDGG VDLTDGYLND KTKEAVKTAK 
DKVISGDVKV PEKPE 



EF021-3 (SEQ ID NO:79) 
ATGTGGTGGT 

GGCAAAGGCA AAACCGCTGA AAGCGGCGGT 
ATCATTACAG ATACAGGCGG CGTGGATGAC 
TTGCAAGCTT GGGGTAAAGA ACATGATTTA 
CAATCGAATG ATGCAGCTGA CTATACAACC 



GGCAAAGGGG ATGCAGCGCA TAGTGCTGTA 
AAGTCGTTCA ACCAATCTTC TTGGGAAGGA 
CCAGAAGGTT CAAAAGGGTA TGCATATATT 
AATATTGACC AAGCGGTATC AAGTAAATTC 
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AACACAATCT TTGGTATTGG CTACTTGCTA 
AACCCTGATA CAAACTTTGT TTTAATCGAT 
TCTGCAACAT TTAGAGATAA OXSAAGCAGCT 
ACAAAAACGA ACAAAGTCGG TTTTGTTGGT 
CAAGCTGGTT TTGAAAAAGG TGTGGCTGAT 
GTTGATACGA AATATGGGGC TTCATTTGCT 
GCAATGTACC AAAACGGCGT TGATATCATC 
GTCTTCCAAG AAGCAAAAGA CTTGAATGAA 
GGCGTTGACC GCGATCAAGA TGCTGATGGC 
AACTTCACGT TAACTTCAAC GCTTAAAGGT 
CGTGCGTTAG AAGACAAATT CCCTGGTGGC 
GGCGTTGACT TAACAGACGG CTATTTAAAC 
AAAGATAAAG TAATCTCAGG TGACGTAAAA 



AAAGATGCAA TTTCTTCTGC AGCAGATGCC 
GATCAAATCG ATGGCAAAAA OAATGTCGTT 
TACTTAGCCG GTGTTGCTGC TGCAAATGAA 
GGTGAAGAAG GGGTCGTAAT TGACCGTTTC 
GCTGCGAAAG AATTAGGTAA AGAAATTACT 
GATCCTGCCA AAGGGAAAGC TTTAGCTGCT 
TTCCATGCTT CTGGTGCGAC TGGACAAGGG 
TCAGGTTCTG GCGACAAAGT TTGGGTAATC 
AAGTACAAAA CAAAAGACGG CAAAGAAGAC 
GTCGGCACAG CGGTTCAAGA TATTGCCAAC 
GAACATTTAG TTTATGGATT AAAAGATGGT 
GACAAAACAA AAGAAGCTGT TAAAACAGCA 
GTCCCAGAAA AACCAGAA 



EF021-4 (SEQ ID NO: 80) 



CGGG KGKTAESGGG KGDAAHSAVI 
ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP 
TIFGIGYLLK DAISSAADAN PDTNFVLIDD 
KTNKVGFVGG EEGWIDRFQ AGFEKGVADA 
MYQNGVDIIF HASGATGQGV FQEAKDLNES 
FTLTSTLKGV GTAVQDIANR ALEDKFPGGE 
DKVISGDVKV PEKPE 



EGSKGYAYIQ SNDAADYTTN IDQAVSSKFN 
QIDGKKNWS ATFRDNEAAY LAGVAAANET 
AKELGKEITV DTKYAASFAD PAKGKALAAA 
GSGDKVWVIG VDRDQDADGK YKTKDGKEDN 
HLVYGLKDGG VDLTDGYLND KTKEAVKTAK 



EF022-1 (SEQ ID N0:81) 

TAAGAGCATA AAAAAATGAA GAGTTATAGG 
ACAATGGTTT GTATTTTATT GGTAGGATTT 
AAAAAGAAAC AGAAAAATAC CAAAGAAGCC 
ACGCTCAACA CCTCTGTATT ATTGGATTTT 
GAAGGGTTAT ATAGTTTAGA TGAACAAGAC 
CCGATGATTT CAGAAGATGG AAAAACCTAC 
AGTAACGATG ATCCTGTCAC AGCACATGAT 
CCTAAAAACG GCTTTGTTTA TAGCTTCCTC 
ATCTCAGCGG GGAAATTAGC ACCCAATGAA 
TTAAAGGTGA CGCTCAAAGA GCCAAAACCG 
TTTTTCCCGC AAAATCNAAA AGTAGTCGAA 
GATAAAGTCG TCTATAATGG TCCGTTCGTG 
TGGCAACTAG CAAAAAATAA TCGCTATTGG 
AATTATACAG TTATCAAAGA AACATCTACC 
GATGTGGCTA CACTAAGTGG TGAACTGGCG 
TCGTATCCAA CAGCGACAAT GAACTATTTG 
ACGCCGCTTG CAAACGAAAA CCTGCGTAAA 
CTAGTCAATA ATATTATTGC AGATGGTTCT 
TTTGTGGCGA ATCCCACAAC GGGTCTCGAT 
TATAACAAAG AAAAAGCGCA AAGTTATTGG 
GTTAACGTTG AATTGATGGT AACAGATGAT 
CAAGGCTCGC TACAAGAATT GTTTCCTGGT 
GAAGCTGCAT TGAACTTTGG GCGAGAAAGT 
CCAGACTATC AAGACCCTAT TTCTACCCTG 
TATCAGAACC CTGTCTATGA CAAATTATTA 
CCAGAAAAAA GATGGGCGAC ACTGATTGCA 



AGAAAGAAGA TGAAAAAGTA TTTAAAAATC 
TTAGCTGGGT GTACCAATAA AAATGAAAAT 
GTTCAACTGA TGTCACCCTC GGAATTAACA 
CCAGATGCTA TTGTCCAAAC TGCAGCGTTT 
CAATTGGTAC CAGCCGTAGC AAAAGCATTG 
ACGATTTCTT TGAGAAAAGA AGCGGTTTGG 
TTTGAATATG CTTGGAAAAA AATGATTGAT 
ATCGTTGAAA CAATTCAAAA OXSGTGCAGAA 
CTAGGTGTCA CAGCTGTGGA TGATTATACA 
TACTTTACGT CCTTGTTAGC TTTTCCGACA 
CAATTTGGTG CGGACTATGG AACTGCTAGT 
GTAAAAGATT GGCAGCAAAC AAAGATGGAC 
GATCACCAGA ACGTGCGCTC AGACATTATC 
GCATTGAATC TTTTTGAAGA TGGACAATTA 
CAACAGAATA AAAATAATAC GTTGTATCAT 
CGCTTAAATC AAAAACGGNA AGGGCAAGCN 
GCATTGGCTT TAGGAATAGA TAAAGAAAAT 
AAAGCGCTAC ATGGTGCGAT TACGGAAGGC 
TTTCGTCAAG AAGCAGGTAA TTTAATGGTT 
AAAAAAGCAC AAGCAGAATT AGGAGAAAAG 
GGTTCTTACA AAAAAATTGG TGAAAGTTTG 
TTGACAATAG AGCTAACCGC ATTGCCGACT 
GACTATGATT TATTCTTAAT TTACTGGACA 
ATGACTTTAT ACAAGGGCAA TGATCGCAAT 
GATGAAGCAG CCACAACCTA TGCCTTAGAG 
GCTGAAAAAG AAGTGATTGA AACGACTGCT 
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GGCATGATTC CACTl'AGCCA AAATGAACAA ACAGTCCTGC AAAATGATAA AGTCAAAGGC 
TTGAATTTTC ATACCTTTGG CGCTCCATTA ACGTTAAAAA ATGTTTATAA GGAAAAATAA 



EF022-2 (SEQ ID NO: 82) 

MKKYLKIT MVCILLVGFL AGCTNKNENK KKQKNTKEAV QLMSPSELTT 
LNTSVLLDFP DAIVQTAAFE GLYSLDEQDQ LVPAVAKALP MISEDGKTYT ISLRKEAVWS 
NDDPVTAHDF EYAWKKMIDP KNGFVYSFLI VETIQNGAEI SAGKLAPNEL GVTAVDDYTL 
KVTLKEPKPY FTSLLAFPTF FPQNXKWEQ FGADYGTASD KWYNGPFW KDWQQTKMDW 
QLAKNNRYWD HQNVRSDIIN YTVIKETSTA LNLFEDGQLD VATLSGELAQ QNKNNTLYHS 
YPTATMNYLR LNQKRXGQAT PLANENLRKA LALGIDKENL VNNIIADGSK ALHGAITEGF 
VANPTTGLDF RQEAGNLMVY NKEKAQSYWK KAQAELGEKV NVELMVTDDG SYKKIGESLQ 
GSLQELFPGL TIELTALPTE AALNFGRESD YDLFLIYWTP DYQDPISTLM TLYKGNDRNY 
QNPVYDKLLD EAATTYALEP EKRWATLIAA EKEVIETTAG MIPLSQNEQT VLQNDKVKGL 
NFHTFGAPLT LKNVYKEK 



EF022-3 (SEQ ID NO:83) 

GT GTACCAATAA AAATGAAAAT 
AAAAAGAAAC AGAAAAATAC CAAAGAAGCC 
ACGCTCAACA CCTCTGTATT ATTGGATTTT 
GAAGGGTTAT ATAGTTTAGA TGAACAAGAC 
CCGATGATTT CAGAAGATGG AAAAACCTAC 
AGTAACGATG ATCCTGTCAC AGCACATGAT 
CCTAAAAACG GCTTTGTTTA TAGCTTCCTC 
ATCTCAGCGG GGAAATTAGC ACCCAATGAA 
TTAAAGGTGA CGCTCAAAGA GCCAAAACCG 
TTTTTCCCGC AAAATCNAAA AGTAGTCGAA 
GATAAAGTCG TCTATAATGG TCCGTTCGTG 
TGGCAACTAG CAAAAAATAA TCGCTATTGG 
AATTATACAG TTATCAAAGA AACATCTACC 
GATGTGGCTA CACTAAGTGG TGAACTGGCG 
TCGTATCCAA CAGCGACAAT GAACTATTTG 
ACGCCGCTTG CAAACGAAAA CCTGCGTAAA 
CTAGTCAATA ATATTATTGC AGATGGTTCT 
TTTGTGGCGA ATCCCACAAC GGGTCTCGAT 
TATAACAAAG AAAAAGCGCA AAGTTATTGG 
GTTAACGTTG AATTGATGGT AACAGATGAT 
CAAGGCTCGC TACAAGAATT GTTTCCTGGT 
GAAGCTGCAT TGAACTTTGG GCGAGAAAGT 
CCAGACTATC AAGACCCTAT TTCTACCCTG 
TATCAGAACC CTGTCTATGA CAAATTATTA 
CCAGAAAAAA GATGGGCGAC ACTGATTGCA 
GGCATGATTC CACTTAGCCA AAATGAACAA 
TTGAATTTTC ATACCTTTGG CGCTCCATTA 



GTTCAACTGA TGTCACCCTC GGAATTAACA 
CCAGATGCTA TTGTCCAAAC TGCAGCGTTT 
CAATTGGTAC CAGCCGTAGC AAAAGCATTG 
ACGATTTCTT TGAGAAAAGA AGCGGTTTGG 
TTTGAATATG CTTGGAAAAA AATGATTGAT 
ATCGTTGAAA CAATTCAAAA TGGTGCAGAA 
CTAGGTGTCA CAGCTGTGGA TGATTATACA 
TACTTTACGT CCTTGTTAGC TTTTCCGACA 
CAATTTGGTG CGGACTATGG AACTGCTAGT 
GTAAAAGATT GGCAGCAAAC AAAGATGGAC 
GATCACCAGA ACGTGCGCTC AGACATTATC 
GCATTGAATC TTTTTGAAGA TGGACAATTA 
CAACAGAATA AAAATAATAC GTTGTATCAT 
CGCTTAAATC AAAAACGGNA AGGGCAAGCN 
GCATTGGCTT TAGGAATAGA TAAAGAAAAT 
AAAGCGCTAC ATGGTGCGAT TACGGAAGGC 
TTTCGTCAAG AAGCAGGTAA TTTAATGGTT 
AAAAAAGCAC AAGCAGAATT AGGAGAAAAG 
GGTTCTTACA AAAAAATTGG TGAAAGTTTG 
TTGACAATAG AGCTAACCGC ATTGCCGACT 
GACTATGATT TATTCTTAAT TTACTGGACA 
ATGACTTTAT ACAAGGGCAA TGATCGCAAT 
GATGAAGCAG CCACAACCTA TGCCTTAGAG 
GCTGAAAAAG AAGTGATTGA AACGACTGCT 
ACAGTCCTGC AAAATGATAA AGTCAAAGGC 
ACGTTAAAAA ATGTTTATAA GGAAAAA 



EF022-4 (SEQ ID NO: 84) 

CTNKNENK KKQKNTKEAV QLMSPSELTT 
LNTSVLLDFP DAIVQTAAFE GLYSLDEQDQ 
NDDPVTAHDF EYAWKKMIDP KNGFVYSFLI 



LVPAVAKALP MISEDGKTYT ISLRKEAVWS 
VETIQNGAEI SAGKLAPNEL GVTAVDDYTL 
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KVTLKEPKPY FTSLLAFPTF FPQNXKWEQ 
QLAKNNRYWD HQNVRSDIIN YTVIKETSTA 
YPTATMNYLR LNQKRXGQAT PLANENLRKA 
VANPTTGLDF RQEAGNLMVY NKEKAQSYWK 
GSLQELFPGL TIELTALPTE AALNFGRESD 
QNPVYDKLLD EAATTYALEP EKRWATLIAA 
NFHTFGAPLT LKNVYKEK 

EF023-1 (SEQ ID NO: 85) 



FGADYGTASD KWYNGPFW KDWQQTKMDW 
LNLFEDGQLD VATLSGELAQ QNKNNTLYHS 
LALGIDKENL VNNIIADGSK ALHGAITEGF 
KAQAELGEKV NVELMVTDDG SYKKIGESLQ 
YDLFLIYWTP DYQDPISTLM TLYKGNDRNY 
EKEVIETTAG MIPLSQNEQT VLQNDKVKGL 



TAAAATGGAG GGATCGGTAT GAAGAAATTA AAAATGTTAG GATGCGTCGG GTTGCTTTTA 
GCTTTAACGG CTTGTCAGGC GGGAACGGGA AACTCGGCTG ATAGTAACAA AGCAGCGGAA 
CAAAAAATTG CAATTAGTTC TGAAGCGGCT ATTTCGACAA TGGAACCACA CACAGCGGGG 
GATACGACCT CGACTTTAGT CATGAATCAA GTTTATGAAG GACTCTATGT TTTAGGTAAA 
GAAGATGAAT TAGAGTTGGG GGTCGCTGCC GAAGAACCAG CGATTTCTGA AGATGAAACC 
GTTTATACAT TTAAGATTAG AGAAGATGCC AAATGGTCGA ATGATGATCC AGTAACAGCA 
AACGACTTTG TTTATGCATG GCAACAAGTT GCTTCCCCTA AATCAGGATC GATTCATCAA 
GCTTTATTTT TTGATGTCAT TAAAAATGCT AAGGAAATTG CTTTAGAAGG CGCAGATGTG 
AATACTCTTG GGGTTAAGGC GCTAGATGAT AAAACGTTAG AAATAACTTT AGAACGGCCC 
ACCCCTTATT TGAAATCATT ACTTTCGTTT CCTGTTTTGT TTCCACAAAA TGAAAAATAT 
ATCAAAGAAC AAGGGGATAA ATATGCTACT GATGCAGAAC ATTTGATTTA TAATGGTCCT 
TTTAAATTGA AAGAATGGGA TAATGCCTCT TCTGATGACT GGACCTACGA AAAAAATGAT 
ACGTATTGGG ATGCTGAAAA AGTTAAATTA ACAGAAGCGA AAGTTTCAGT AATTAAGAGC 
CCAACGACAG CGGTGAATTT GTTTGACTCG AATGAATTGG ATGTAGTGAA TAAGCTAAGT 
GGTGAATTTA TTCCTGGTTA TGTTGATAAT CCAGCCTTTC TTTCAATTCC TCAATTCGTC 
ACATACTTTT TAAAAATGAA CAGCGTTCGT GATGGAAAAG AAAATCCGGC TTTAGCGAAC 
AACAATATTC GTAAAGCGTT GGCACAAGCT TTTGATAAAG AAAGTTTTGT AAAAGAAGTC 
TTGCAAGATC AATCAACGGC TACAGATCAA GTAATTCCGC CGGGACAAAC GATTGCGCCA 
GATGGAACAG ATTTCACAAA ACTAGCTGCT AAGAAAAATA ACTACTTAAC CTACGATACA 
GCGAAAGCAA AAGAATTCTG GGAAAAAGGG AAAAAAGAAA TTGGGCTGGA TAAAATCAAA 
TTAGAATTTT TAACAGATGA TACAGACAGC GCCAAAAAAG CTGCTGAGTT TTTCCAATTT 
CAATTGGAAG AAAATCTAGA TGGATTAGAA GTGAATGTTA CTCAAGTTCC TTTTACTATT 
CGTGTTGATC GTGATCAAAC GAGAGACTAT GATTTAGAAT TATCTGGTTG GGGAACCGAT 
TATCGTGATC CATTAACAGT TATGCGCATC TTTACTTCGG ATAGTACCTT GGGCGGCGTA 
ACGTTCAAGA GTGATACGTA TGATCAATTA ATTCAAGAAA CTAGAACAAC ACATGCGGCT 
GATCAAGAGG CTCGTTTAAA TGACTTTGCT CAAGCACAAG ATATTTTGGT GAATCAGGAA 
ACGGTTTTAG CACCAATCTA CAATCGAAGC ATTTCTGTAT TAGCTAATCA AAAAATCAAG 
GATCTGTATT GGCATTCATT TGGACCCACG TACAGTTTAA AATGGGCTTA TGTTAACTAA 



EF023-2 (SEQ ID NO: 86) 

MKKLK MLGCVGLLLA LTACQAGTGN SADSNKAAEQ KIAISSEAAI STMEPHTAGD 
TTSTLVMNQV YEGLYVLGKE DELELGVAAE EPAISEDETV YTFKIREDAK WSNDDPVTAN 
DFVYAWQQVA SPKSGSIHQA LFFDVIKNAK EIALEGADVN TLGVKALDDK TLEITLERPT 
PYLKSLLSFP VLFPQNEKYI KEQGDKYATD AEHLIYNGPF KLKEWDNASS DDWTYEKNDT 
YWDAEKVKLT EAKVSVIKSP TTAVNLFDSN ELDWNKLSG EFIPGYVDNP AFLSIPQFVT 
YFLKMNSVRD GKENPALANN NIRKALAQAF DKESFVKEVL QDQSTATDQV IPPGQTIAPD 
GTDFTKLAAK KNNYLTYDTA KAKEFWEKGK KEIGLDKIKL EFLTDDTDSA KKAAEFFQFQ 
LEENLDGLEV NVTQVPFTIR VDRDQTRDYD LELSGWGTDY RDPLTVMRIF TSDSTLGGVT 
FKSDTYDQLI QETRTTHAAD QEARLNDFAQ AQDILVNQET VLAPIYNRSI SVLANQKIKD 
LYWHSFGPTY SLKWAYVN 
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EF023-3 {SEQ ID NO: 87) 

GGGAACGGGA AACTCGGCTG ATAGTAACAA AGCAGCGGAA 

CAAAAAATTG CAATTAGTrC TGAAGCGGCT ATTTCGACAA TGGAACCACA CACAGCGGGG 
GATACGACCT CGACTTTAGT CATGAATCAA GTTTATGAAG GACTCTATGT TTTAGGTAAA 
GAAGATGAAT TAGAGTTGGG GGTCGCTGCC GAAGAACCAG CGATTTCTGA AGATGAAACC 
GTTTATACAT TTAAGATTAG AGAAGATGCC AAATGGTCGA ATGATGATCC AGTAACAGCA 
AACGACTTTG TTTATGCATG GCAACAAGTT GCTTCCCCTA AATCAGGATC GATTCATCAA 
GCTTTATTTT TTGATGTCAT TAAAAATGCT AAGGAAATTG CTTTAGAAGG GGCAGATGTG 
AATACTCTTG GGGTTAAGGC GCTAGATGAT AAAACGTTAG AAATAACTTT AGAACGGCCC 
ACCCCTTATT TGAAATCATT ACTTTCGTTT CCTGTTTTGT TTCCACAAAA TGAAAAATAT 
ATCAAAGAAC AAGGGGATAA ATATGCTACT GATGCAGAAC ATTTGATTTA TAATGGTCCT 
TTTAAATTGA AAGAATGGGA TAATGCCTCT TCTGATGACT GGACCTACGA AAAAAATGAT 
ACGTATTGGG ATGCTGAAAA AGTTAAATTA ACAGAAGCGA AAGTTTCAGT AATTAAGAGC 
CCAACGACAG CGGTGAATTT GTTTGACTCG AATGAATTGG ATGTAGTGAA TAAGCTAAGT 
GGTGAATTTA TTCCTGGTTA TGTTGATAAT CCAGCCTTTC TTTCAATTCC TCAATTCGTC 
ACATACTTTT TAAAAATGAA CAGCGTTCGT GATGGAAAAG AAAATCCGGC TTTAGCGAAC 
AACAATATTC GTAAAGCGTT GGCACAAGCT TTTGATAAAG AAAGTTTTGT AAAAGAAGTC 
TTGCAAGATC AATCAACGGC TACAGATCAA GTAATTCCGC CGGGACAAAC GATTGCGCCA 
GATGGAACAG ATTTCACAAA ACTAGCTGCT AAGAAAAATA ACTACTTAAC CTACGATACA 
GCGAAAGCAA AAGAATTCTG GGAAAAAGGG AAAAAAGAAA TTGGGCTGGA TAAAATCAAA 
TTAGAATTTT TAACAGATGA TACAGACAGC GCCAAAAAAG CTGCTGAGTT TTTCCAATTT 
CAATTGGAAG AAAATCTAGA TGGATTAGAA' GTGAATGTTA CTCAAGTTCC TTTTACTATT 
CGTGTTGATC GTGATCAAAC GAGAGACTAT GATTTAGAAT TATCTGGTTG GGGAACCGAT 
TATCGTGATC CATTAACAGT TATGCGCATC TTTACTTCGG ATAGTACCTT GGGCGGCGTA 
ACGTTCAAGA GTGATACGTA TGATCAATTA ATTCAAGAAA CTAGAACAAC ACATGCGGCT 
GATCAAGAGG CTCGTTTAAA TGACTTTGCT CAAGCACAAG ATATTTTGGT GAATCAGGAA 
ACGGTTTTAG CACCAATCTA CAATCGAAGC ATTTCTGTAT TAGCTAATCA AAAAATCAAG 
GATCTGTATT GGCATTCATT TGGACCCACG TACAGTTTAA AATGGGCTTA TGTTAAC 



EF023-4 {SEQ ID NO:88) 



GTGN SADSNKAAEQ KIAISSEAAI STMEPl 
TTSTLVMNQV YEGLYVLGKE DELELGVAAE 
DFVYAWQQVA SPKSGSIHQA LFFDVIKNAK 
PYLKSLLSFP VLFPQNEKYI KEQGDKYATD 
YWDAEKVKLT EAKVSVIKSP TTAVNLFDSN 
YFLKMNSVRD GKENPALANN NIRKALAQAF 
GTDFTKLAAK KNNYLTYDTA KAKEFWEKGK 
LEENLDGLEV NVTQVPFTIR VDRDQTRDYD 
FKSDTYDQLI QETRTTHAAD QEARLNDFAQ 
LYWHSFGPTY SLKWAYVN 



EPAISEDETV YTFKIREDAK WSNDDPVTAN 
EIALEGADVN TLGVKALDDK TLEITLERPT 
AEHLIYNGPF KLKEWDNASS DDWTYEKNDT 
ELDWNKLSG EFIPGYVDNP AFLSIPQFVT 
DKESFVKEVL QDQSTATDQV IPPGQTIAPD 
KEIGLDKIKL EFLTDDTDSA KKAAEFFQFQ 
LELSGWGTDY RDPLTVMRIF TSDSTLGGVT 
AQDILVNQET VLAPIYNRSI SVLANQKIKD 



EF024-1 {SEQ ID NO: 89) 



TAATGGCCGT TTCGTCTACT AATAAAGAGG 
AACAAGGATC ATAAAAAAGG AGAAGTGAGC 
GTCGGCTTGT TATTGTTGTC AGGTTGTGGA 
GGTGGTAAAT GGAAAGTGGA AGAAACACGT 
TTTTCAGCTA ATGACTCAGA GGATAGTGTT 
AAAAAAATAA CCTTTGACNT TACTAGCAGN 
AANGNTANCA AGATTACAGG GGAAATTGGC 
ACAGAATAA 



ATGAAGCTAC TCAAATGGCG TTGGCAATGG 
ATGAAAAAAG TACTACCTTT TATTGCCTTA 
ACAGATATGA AAAAGATATT GACTGCCGAT 
GCAACTTACA CTTTTTTTGA TGACGGTAAA 
AGTGGGACAT ACACTTATGA TGAAAAAAAT 
AACTCTTTCA TTATGGAAAA AGTNGANTNC 
GAAAAACAAA GAACACTTAT AAAACAAAAA 
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EF024-2 (SEQ ID NO: 90) 

M KKVLPFIALV GLLLLSGCGT DMKKILTADG 

GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFDXTSXN SFIMEKVXXX 
XXKITGEIGE KQRTLIKQKT E 



EF024-3 (SEQ ID NO: 91) 
ATT GACTGCCGAT 

GGTGGTAAAT GGAAAGTGGA AGAAACACGT GCAACTTACA CTTTTTTTGA TGACGGTAAA 
TTTTCAGCTA ATGACTCAGA GGATAGTGTT AGTGGGACAT ACACTTATGA TGAAAAAAAT 
AAAAAAATAA CCTTTGACNT TACTAGCAGN AACTCTTTCA TTATGGAAAA AGTNGANTNC 
AANGNTANCA AGATTACAGG GGAAATTGGC GAAAAACAAA GAACACTTAT AAAACAAAAA 
ACAGAA 



EF024-4 (SEQ ID NO:92) 



LTADG 

GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFDXTSXN SFIMEKVXXX 
XXKITGEIGE KQRTLIKQKT E 



EF025-1 (SEQ ID NO:93) 



TGAATGAAAC ATATTAAAGG AATGTTGGTT 
GCGCCAGATC AAGAGCCAAC GAAACAAACA 
AAGCAAGTTA CCGTCACCAA TCAAACGACT 
AATGACGAAC TGATTGCTAA TCAATTGACT 
GTTACAGGGG CCACACAAAC GACATTTGGA 
GAAAAAAAGA AAAAAATGTT TTGGTCCAAT 
TATTATAAAA ATGAAGGTGT ATTTACTGGC 
GAACCTGAAA CGCAAAGGAT TCTGAATGTT 
TATGATACAC GCTATTCGGG TGTCAACAAA 
AGCAACACGC GTACAGACGA TACGTTAGTC 
AAACAAATGC GTGACGAAAA TCGTGTTACA 
ACTTCTGCGC GTGAAGGATT AATGCCTTTA 
CCATCGAAAG AAACGTATAT CGGTTACGCA 
CTTCAAGTGA TAACAGAAGA GCAGAAAATA 
GATGAACAGG AAAAAATCAC AGAAACAGCC 
ATTCACCAGG ATACAATAAA CAAACCAACA 



TTTATCGGAT TATTTATTTT GGTTGGTTGT 
ACAAGTGGTC CGCAAGAGAC AAAGCAAGTG 
TCTGCGGTGG AAAAACAAGC GCCGACTAAA 
TTTGATTCTC ATGAATACAC GTACGAAGTG 
ACAACCCCAC CAGCAAAATA TACACCGGAA 
CAACCGCCTT TGGGATTAAT GACGGGTAAC 
GGAAATTACG <3CATTGTAGA GATTATTACG 
GAGTTTACAG AGTTTGCTAG TGATCCTTAT 
CGCCTGTCGG ATTATCCTGA ATTTCAAGCA 
ACCGTTGTTA ATGGTATTAC TTATGTAGAA 
GGTAATTTTT ATACGGTACG CGGTTCATCA 
GCAGCAGAGA TGGACACTTG GCTAAAAGAG 
GAAGATTTAG GCAATGGCCT AATCGCTCGA 
AAACATGTCA GCTATGATGA ATACTTTTCA 
TGCGGCCTTT TTATCGTCAA TCGAAATATT 
ATTCTTTTAT TCATTTTGTA G 



EF025-2 (SEQ ID NO: 94) 

MKHIKGMLVF IGLFILVGCA PDQEPTKQTT 
DELIANQLTF DSHEYTYEW TGATQTTFGT 
YKNEGVFTGG NYGIVEIITE PETQRILNVE 
NTRTDDTLVT WNGITYVEK QMRDENRVTG 
SKETYIGYAE DLGNGLIARL QVITEEQKIK 
HQDTINKPTI LLFIL 



SGPQETKQVK QVTVTNQTTS AVEKQAPTKN 
TPPAKYTPEE KKKKMFWSNQ PPLGl^MTGNY 
FTEFASDPYY DTRYSGVNKR LSDYPEFQAS 
NFYTVRGSST SAREGLMPLA AEMDTWLKEP 
HVSYDEYFSD EQEKITETAC GLFIVNRNII 



EF025-3 (SEQ ID NO:95) 
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AAC GAAACAAACA ACAAGTGGTC CGCAAGAGAC AAAGCAAGTG 

AAGCAAGTTA CCGTCACCAA TCAAACGACT TCTGCGGTGG AAAAACAAGC GCCGACTAAA 
AATGACGAAC TGATTGCTAA TCAATTGACT TTTGATTCTC ATGAATACAC GTACGAAGTG 
GTTACAGGGG CCACACAAAC GACATTTGGA ACAACCCCAC CAGCAAAATA TACACCGGAA 
GAAAAAAAGA AAAAAATGTT TTGGTCCAAT CAACCGCCTT TGGGATTAAT GACGGGTAAC 
TATTATAAAA ATGAAGGTGT ATTTACTGGC GGAAATTACG GCATTGTAGA GATTATTACG 
GAACCTGAAA CGCAAAGGAT TCTGAATGTT GAGTTTACAG. AGTTTGCTAG TGATCCTTAT 
TATGATACAC GCTATTCGGG TGTCAACAAA CGCCTGTCGG ATTATCCTGA ATTTCAAGCA 
AGCAACACGC GTACAGACGA TACGTTAGTC ACCGTTGTTA ATGGTATTAC TTATGTAGAA 
AAACAAATGC GTGACGAAAA TCGTGTTACA GGTAATTTTT ATACGGTACG CGGTTCATCA 
ACTTCTGCGC GTGAAGGATT AATGCCTTTA GCAGCAGAGA TGGACACTTG GCTAAAAGAG 
CCATCGAAAG AAACGTATAT CGGTTACGCA GAAGATTTAG GCAATGGCCT AATCGCTCGA 
CTTCAAGTGA TAACAGAAGA GCAGAAAATA AAACATGTCA GCTATGATGA ATACTTTTCA 
GATGAACAGG AAAAAATCAC AGAAACAGCC TGCGGCCTTT TTATCGTCAA TCGAAATATT 
ATTCACCAGG ATACAATAAA CAAACCAACA ATTCTTTTAT TCATTTTG 

EF025-4 (SEQ ID NO: 96) 



TKQTT SGPQETKQVK QVTVTNQTTS AVEKQAPTKN 

DELIANQLTF DSHEYTYEW TGATQTTFGT TPPAKYTPEE KKKKMFWSNQ PPLGLMTGNY 
YKNEGVFTGG NYGIVEIITE PETQRILNVE FTEFASDPYY DTRYSGVNKR LSDYPEFQAS 
NTRTDDTLVT WNGITYVEK QMRDENRVTG NFYTVRGSST SAREGLMPLA AEMDTWLKEP 
SKETYIGYAE DLGNGLIARL QVITEEQKIK HVSYDEYFSD EQEKITETAC GLFIVNRNII 
HQDTINKPTI LLFIL 



EF026-1 (SEQ ID NO: 97) 

TGAGTGTATG ATTACTCATT TCCCTTTGAA TCAGTTATGA TAAAGGAAGA AATAAATAAA 
TTTTTTGGAG GGATTTTCAT GAAAATGTCT AAAGTACTCA CCACTGTTTT GACGGCAACT 
GCTGCTCTTG TGTTGCTTAG TGCTTGTTCA TCTGATAAAA AAACAGATAG TAGTTCTAGT 
AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC 
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT 
TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT 
GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT CAACATCGTT 
CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC 
AAAGTTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC TGACTTAGAA 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA ATAATTAA 



EF026-2 (SEQ ID NO:98) 



MKMSK VLTTVLTATA ALVLLSACSS DKKTDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 



EF026-3 (SEQ ID NO: 99) 



AACAGATAG TAGTTCTAGT 
AGCAAAGAAA CAGCTAATTC 
CCTGAAGAGC TCGAAATGGC 
AATGTCACTT TTGATAAAGA 



AAGTACAGAA GTAGTCTCTG 
GTTAAGTGAT AAAGGAAATT 
GGTAACAGTT GCTGGTACTT 



GTGCTTCAAT TAGTGCCAAG 
GGATTGTCGC AGCTACTGAC 
TCCATGATAA GGGGAAAGAT 
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TCCAACGATG TCTATCGTAA ATTAGCACTT 
GCTGAATATG AAATCACGGT TCCTAAGCTA 
CACGGGACTG- TCAAAGGTGA TATTGAGGTG 
AAAGTTAATG GCAATATTAC TTTTGATAAA 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC 



TATTCCCAAG ATGATAATAA AAAAGTAACT 
ATCGTTTCTT CTGAAAATTT CAACATCGTT 
AAAGCAAATG GCTTTACTTT AAATCGTACC 
CAAGAATACA AAGATTCTGC TGACTTAGAA 
ACCGTAGCCA ATAAT 



EF026-4 (SEQ ID NO:100) 
TDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 



EF027-1 (SEQ ID NO:101) 



TTTGGTATGA AACAGAAAAA GTGGTTAATC 
GCATGTGGAA GTGGCGGTTC GAAAACGACC 
GTCGCATCTG GTGGTGAACT CTCGACATTA 
TCCGATATGA TTGGTCAAGT AGTTGAAGGC 
GAGCTAGCTA TGGCGAAAGC AGAGCCACAA 
AAGTTACGAG AAGCAAAATG GACAAACGGG 
GCGTTTAGAA ACGTGGTCGA TCCAGCATAC 
TTTAAAAATG GGCGTGCGGT GCGGGAAGGA 
GCAATCGATG ACCAGACACT AGAACTAACA 
GTCTTGGTTG GGACACCTTT TATGCCTAAA 
GCCTATGGGA CTTCTGCAGA TAATTTTGTT 
GATGGCAATT CCGAAACTTG GAAATTGAAG 
GTAAAATTGA ATGAAATTGA TGTTCAAGTA 
TTTGATAATG GCGACTTAGA TTACACTGTT 
GAGTCAAAAC AAGCGCATTT TGTACCTAAA 
CGCCGTGAAA TTACCGGCAA CGAACATGTT 
GAAACTTTTG CAAAAGAAAT TTTAGGAGAT 
GCTAATTTTG CAAAAATCCA GATACAGGTG 
TGCCATATAA TATTAAAGAA GCCCAAGCTA 



GGACTTGTTG CACTGGGCTT GGTTTTAGCA 
TCAAACGAAC CAGCTACACA GAAAATTAAC 
GACAGCGCTC ATTATACAGA TGTCTATAGT 
TTGTATCGAC AAGATAAAAA CGGAGATCCT 
GTTAGTGAAG ACGGGTTAGT CTATACATTC 
GATCCAGTTA AAGCAGGGGA TTTTGTAGTT 
GGTTCAAGTA GCAGTAATCA AATGGATATT 
CAAGCCACGA TGGAAGAATT TGGTGTCAAA 
TTGGAAAATC CAATTCCTTA TTTAGCCCAA 
AATGAAGCCT TTGCCAAAGA AAAAGGTACT 
GGCAATGGGC CGTTTGTAAT TTCAGGTTCG 
AAGAATGATC ATTATTGGGA TAAAGAACAC 
GTGAAAGAAA TTGGCACAGG AGCCAATCTT 
TTAGCAGATA CTTATGCACT TCAGTATAAA 
GCCATGGTGG GTTATTTAAG CCCCAATCAT 
CGAAAAGCTT TTTTACAAGC GATTGACAAA 
GGCTCGACAG CTTTAAATGG NTTTGTACCA 
AAGATTTCCG CAAAGAAAAT GGTGATTTAT 
ACTGGAACAA TT 



EF027-2 (SEQ ID NO:102) 



MKQKKWLI GLVALGLVLA ACGSGGSKTT SNEPATQKIN VASGGELSTL DSAHYTDVYS 
SDMIGQWEG LYRQDKNGDP ELAMAKAEPQ VSEDGLVYTF KLREAKWTNG DPVKAGDFW 
AFRNWDPAY GSSSSNQMDI FKNGRAVREG QATMEEFGVK AIDDQTLELT LENPIPYLAQ 
VLVGTPFMPK NEAFAKEKGT AYGTSADNFV GNGPFVISGW DGNSETWKLK KNDHYWDKEH 
VKLNEIDVQV VKEIGTGANL FDNGDLDYTV LADTYALQYK ESKQAHFVPK AMVGYLSPNH 
RREITGNEHV RKAFLQAIDK ETFAKEILGD GSTALNGFVP ANFAKIQIQV KISAKKMVIY 
CHIILKKPKL TGTI 

EF027-3 {SEQ ID NO: 103) 



AACGACC TCAAACGAAC CAGCTACACA GAAAATTAAC 

GTCGCATCTG GTGGTGAACT CTCGACATTA GACAGCGCTC ATTATACAGA TGTCTATAGT 
TCCGATATGA TTGGTCAAGT AGTTGAAGGC TTGTATCGAC AAGATAAAAA CGGAGATCCT 
GAGCTAGCTA TGGCGAAAGC AGAGCCACAA GTTAGTGAAG ACGGGTTAGT CTATACATTC 
AAGTTACGAG AAGCAAAATG GACAAACGGG GATCCAGTTA AAGCAGGGGA TTTTGTAGTT 
GCGTTTAGAA ACGTGGTCGA TCCAGCATAC GGTTCAAGTA GCAGTAATCA AATGGATATT 
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TTTAAAAATG GGCGTGCGGT GCGGGAAGGA CAAGCCACGA TGGAAGAATT TGGTGTCAAA 
GCAATCGATG ACCAGACACT AGAACTAACA TTGGAAAATC CAATTCCTTA TTTAGCCCAA 
GTCTTGGTTG GGACACCTTT TATGCCTAAA AATGAAGCCT TTGCCAAAGA AAAAGGTACT 
GCCTATGGGA CTTCTGCAGA TAATTTTGTT GGCAATGGGC CGTTTGTAAT TTCAGGTTGG 
GATGGCAATT CCGAAACTTG GAAATTGAAG AAGAATGATC ATTATTGGGA TAAAGAACAC 
GTAAAATTGA ATGAAATTGA TGTTCAAGTA GTGAAAGAAA TTGGCACAGG AGCCAATCTT 
TTTGATAATG GCGACTTAGA TTACACTGTT TTAGCAGATA CTTATGCACT TCAGTATAAA 
GAGTCAAAAC AAGCGCATTT TGTACCTAAA GCCATGGTGG GTTATTTAAG CCCCAATCAT 
CGCCGTGAAA TTACCGGCAA CGAACATGTT CGAAAAGCTT TTTTACAAGC GATTGACAAA 
GAAACTTTTG CAAAAGAAAT. TTTAGGAGAT GGCTCGACAG CTTTAAATGG NTTTGTACCA 
GCTAATTTTG CAAAAATCCA GATACAGGTG AAGATTTCCG CAAAGAAAAT GGTGATTTAT 
TGCCATATAA TATTAAAGAA GCCCAAGCTA A 

EF027-4 (SEQ ID NO: 104) 

TT SNEPATQKIN VASGGELSTL DSAHYTDVYS 

SDMIGQWEG LYRQDKNGDP ELAMAKAEPQ VSEDGLVYTF KLREAKWTNG DPVKAGDFW 
AFRNWDPAY GSSSSNQMDI FKNGRAVREG QATMEEFGVK AIDDQTLELT LENPIPYLAQ 
VLVGTPFMPK NEAFAKEKGT AYGTSADNFV GNGPFVISGW DGNSETWKLK KNDHYWDKEH 
VKLNEIDVQV VKEIGTGANL FDNGDLDYTV LADTYALQYK ESKQAHFVPK AMVGYLSPNH 
RREITGNEHV RKAFLQAIDK ETFAKEILGD GSTALNGFVP ANFAKIQIQV KISAKKMVIY 
CHIILKKPKL 



EF028-1, (SEQ ID NO: 105) 

TAACAGAAGC AATACAACAA CTTAACACTT TGTTTACTTG TTATTTATCA GAAATCAACT 
AAGACTTGTT ATAGTCAATG TATGGGTAGA TATGAAGGAG GAAACAAGGA AATGAAGAAA 
AGAGCTTTGC TAGGGGTTAC CTTATTAACA TTCACAACAT TAGCGGGTTG TACAAATTTA 
TCTGAACAGA AAAGCGGCGA AAAACAAACA GAGGTTGCTG AAGCGAAGGC AACTGAATCT 
GAAAAAGCAT CAGTAAAAAA TGTTATTTTT ATGATTGGAG ATGGCATGGG GAATCCGTAT 
ACAACGGGCT ATCGCTATTT CAAAGCCAAT CACTCAGACA AGCGTGTTCC CCAAACAGCT 
TTTGATACCT ATTTGGTCGG ACAGCAAGCC ACTTATCCAG AAGATGAAGA AGAGAATGTC 
ACCGATTCAG CTTCCGCAGC GACAGCGATG GCTGCCGGAG TGAAAACCTA TAATAATGCT 
ATTGCACTCG ATAATGACAA GTCCAAAACA GAAACAGTGC TCGAACGTGC GAAAAAAGTG 
GGGAAATCAA CGGGTCTTGT AGCAACATCT GAAATAACAC ATGCAACCCC TGCTGCATAT 
GGCGCACATA ATGTTTCACG CAAAAATATG GCAGAAATCG CCGATGACTA TTTTGATGAT 
CAAATCGACG GACAACACAA AGTCGATGTG TTACTTGGCG GCGGCTCGGA ATTATTTGCC 
CGGAAAGATC GTGATTTAGT CAAAGAATTT TCCCAAGCGG GTTATGGTCA TGTCACAGAC 
AAAAAGTCGT TAAATGAGAA CCAAGACGAC AAAATTTTAG GCTTGTTTGC ACCAGGCGGG 
CTACCTAAAA TGATTGACCG AACGGAAGAA GTCCCTTCAT TAGCTGATAT GACAGAAGCG 
GCTCTTCAAC GGTTAGATAA AAATGAAAAA GGTTTCTTTT TAATGGTTGA AGGTAGTCAA 
ATTGATTGGG CCGGGCATAG CAATGATATT GTTGGCGCGA TGAGCGAAAT GCAAGACTTC 
GAAGCGGCGT TTGAAAAGGC CATCGATTTT GCCAAAAAAG ATGGTGAACA TTGGTGGTTA 
CAACTGCAGA TCATTCAACA GGGGGCTTGT CTTTAG 

EF028-2 (SEQ ID NO: 106) 

MKKR ALLGVTLLTF TTLAGCTNLS 

EQKSGEKQTE VAEAKATESE KASVKNVIFM IGDGMGNPYT TGYRYFKANH SDKRVPQTAF 
DTYLVGQQAT YPEDEEENVT DSASAATAMA AGVKTYNNAI ALDNDKSKTE TVLERAKKVG 
KSTGLVATSE ITHATPAAYG AHNVSRKNMA EIADDYFDDQ IDGQHKVDVL LGGGSELFAR 
KDRDLVKEFS QAGYGHVTDK KSLNENQDDK ILGLFAPGGL PKMIDRTEEV PSLADMTEAA 
LQRLDKNEKG FFLMVEGSQI DWAGHSNDIV GAMSEMQDFE AAFEKAIDFA KKDGEHWWLQ 
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LQIIQQGACL 



EF028-3 (SEQ ID NO: 107) 



ACAGA AAAGCGGCGA AAAACAAACA GAGGTTGCTG AAGCGAAGGC AACTGAATCT 
GAAAAAGCAT CAGTAAAAAA TGTTATTTTT ATGATTGGAG ATGGCATGGG GAATCCGTAT 
ACAACGGGCT ATCGCTATTT CAAAGCCAAT CACTCAGACA AGCGTGTTCC CCAAACAGCT 
TTTGATACCT ATTTGGTCGG ACAGCAAGCC ACTTATCCAG AAGATGAAGA AGAGAATGTC 
ACCGATTCAG CTTCCGCAGC GACAGCGATG GCTGCCGGAG TGAAAACCTA TAATAATGCT 
ATTGCACTCG ATAATGACAA GTCCAAAACA GAAACAGTGC TCGAACGTGC GAAAAAAGTG 
GGGAAATCAA CGGGTCTTGT AGCAACATCT GAAATAACAC ATGCAACCCC TGCTGCATAT 
GGCGCACATA ATGTTTCACG CAAAAATATG GCAGAAATCG CCGATGACTA TTTTGATGAT 
CAAATCGACG GACAACACAA AGTCGATGTG TTACTTGGCG GCGGCTCCGA ATTATTTGCC 
CGGAAAGATC GTGATTTAGT CAAAGAATTT - TCCCAAGCGG GTTATGGTCA TGTCACAGAC 
AAAAAGTCGT TAAATGAGAA CCAAGACGAC AAAATTTTAG GCTTGTTTGC ACCAGGCGGG 
CTACCTAAAA TGATTGACCG AACGGAAGAA GTCCCTTCAT TAGCTGATAT GACAGAAGCG 
GCTCTTCAAC GGTTAGATAA AAATGAAAAA GGTTTCTTTT TAATGGTTGA AGGTAGTCAA 
ATTGATTGGG CCGGGCATAG CAATGATATT GTTGGCGCGA TGAGCGAAAT GCAAGACTTC 
GAAGCGGCGT TTGAAAAGGC CATCGATTTT GCCAAAAAAG ATGGTGAACA TTGGTGGTTA 
CAACTGCAGA TCATTCAACA GGGGGCTTGT CTT 



EF028-4 (SEQ ID NO: 108) 



QKSGEKQTE VAEAKATESE KASVKNVIFM IGDGMGNPYT TGYRYFKANH SDKRVPQTAF 
DTYLVGQQAT YPEDEEENVT DSASAATAMA AGVKTYNNAI ALDNDKSKTE TVLERAKKVG 
KSTGLVATSE ITHATPAAYG AHNVSRKNMA EIADDYFDDQ IDGQHKVDVL LGGGSELFAR 
KDRDLVKEFS QAGYGHVTDK KSLNENQDDK ILGLFAPGGL PKMIDRTEEV PSLADMTEAA 
LQRLDKNEKG FFLMVEGSQI DWAGHSNDIV GAMSEMQDFE AAFEKAIDFA KKDGEHWWLQ 
LQIIQQGACL 

EF029-1 (SEQ ID NO: 109) 

TGAAGGAGGG AGAAAATGAA AAAGTTAATC GGTAAAAAGT GGCTGCTGCT TACAGCAGTA 
GCCACTTTTT TATTATCAGG ATGCGCAAGT CTTGAACAAA AAGCACAGGA TAGTGTAAAA 
GAAGTTACTG AAAATGTTAC TCAAACTATT TCAAACGATC AACGTATACC AGCTGATTTT 
GTTAGGCACG TGGATGGCGA TACCACAGTA TTAAAAATTG ACGGAAAAGA ACAAAAAGTT 
CGGTTTTTAT TAATTGACAC ACCCGAGACT GTGAAACCGA AAACAAAAGT TCAGCCGTTC 
GGATTGGAAG CTAGCAAACG CACAAAAGAG CTTTTGTCTA CTGCTTCAGA AATTACGTTT 
GAATATGATA AGGGCGATAA AACAGATCGT TACGGACGAG CGTTGGGCTA CATATTCGTA 
GATGGAACAT TACTACAAAA AACGCTTGTA AGTGAAGGAT TAGCTCGTGT TGCCTATGTA 
AAAGAGCCTA CAACTAAGTA TTTGGCAGAA CTAGAGCAAG CCCAAGAACA GGCTAAAAAT 
GAGTCACTCG GAATCTGGAG CATACCAGGT TATGTGACAC AACGGGGGTT TAGTAAATAA 



EF029-2 (SEQ ID NO: 110) 

MKKLIG KKWLLLTAVA TFLLSGCASL EQKAQDSVKE VTENVTQTIS NDQRIPADFV 
RHVDGDTTVL KIDGKEQKVR FLLIDTPETV KPKTKVQPFG LEASKRTKEL LSTASEITFE 
YDKGDKTDRY GRALGYIFVD GTLLQKTLVS EGLARVAYVK EPTTKYLAEL EQAQEQAKNE 
SLGIWSIPGY VTQRGFSK 



EF029-3 (SEQ ID NO: 111) 



AAATGTTAC TCAAACTATT TCAAACGATC AACGTATACC AGCTGATTTT 

GTTAGGCACG TGGATGGCGA TACCACAGTA TTAAAAATTTG ACGGAAAAGA ACAAAAAGTT 
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CGGTTTTTAT TAATTGACAC ACCCGAGACT GTGAAACCGA AAACAAAAGT TCAGCCGTTC 
GGATTGGAAG CTAGCAAACG CACAAAAGAG CTTTTGTCTA CTGCTTCAGA AATTACGTTT 
GAATATGATA AGGGCGATAA AACAGATCGT TACGGACGAG CGTTGGGCTA CATATTCGTA 
GATGGAACAT TACTACAAAA AACGCTTGTA AGTGAAGGAT TAGCTCGTGT TGCCTATCTA 
AAAGAGCCTA CAACTAAGTA TTTGGCAGAA CTAGAGCAAG CCCAAGAACA GGCTAAAAAT 
GAGTCACTCG GAATCTGGAG CATACCAGGT TATGTGACAC AACGGGGGTT TAGTAAA 



EF029-4 (SEQ ID NO: 112) 

NVTQTIS NDQRIPADFV 
RHVDGDTTVL KIDGKEQKVR FLLIDTPETV 
YDKGDKTDRY GRALGYIFVD GTLLQKTLVS 
SLGIWSIPGY VTQRGFSK 

EF030-1 (SEQ ID NO: 113) 



KPKTKVQPFG LEASKRTKEL LSTASEITFE 
EGLARVAYVK EPTTKYLAEL EQAQEQAKNE 



TGATTGACAC ATAGGGGGAA TAGTATGAAA AAGTTAAAAA TGATGGGGAT TATGTTATTT 
GTTAGTACGG TCTTGGTAGG TTGTGGCACA ACAGCAGANA CAAAAATAGA CGAGAAAGCA 
ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT TTAAATTTAA TCGAGAACTC GGAAATCGGT 
TCAATGGATT CTATTTTTAC ACAAGATGAA GCCAGTATTA ACGCACAGTC CAATGTCTTT 
GAAGGGTTAT ATCAATTGGA TGAAAAAGAT CAACTAATAC CTGCTGCTGC TAAAGAGATG 
CCAGAAATTT CTGAGGATGG CAAACGATAT ACCATTAAAC TAAGAGAAGA TGGCAAGTGG 
TCCAATGGTG ATGCTGTAAC AGCCAATGAT TTCGTTTTTG CTTGGCGTAA ATTAGCGAAT 
CCCAAAAACC AAGCCAATTA CTTTTTCTTG TTAGAAGGAA CGATTCTGAA CGGAACAGCT 
ATTACAAAAG AGGAAAAAGC ACCAGAGGAA TTGGGTGTCA AAGCGCTTGA TGATTATACT 
TTGGAGGTTA CTTTAGAAAA GCCTGTACCA TATTTTACGT CGTTATTGGC ATTTTCTCCA 
TTTTTCCCAC AAAACGAAGC ATTCGTGAAA GAAAAAGGAC AAGCCTATGG CACTTCTAGT 
GAAATGATTG TATCTAATGG TCCGTTTTTA ATGAAAAATT GGGATCAGTC AGCGATGTCG 
TGGGATTTTG TGCGTAATCC CTACTATTAC GATAAAGAAA AAGTAAAATC AGAAACGATT 
CATTTTGAAG TTCTTAAAGA AACCAATACC GTTTATAATT TGTACGAATC AGGTGAATTA 
GATGTGGCTG TCTTAACAGG AGATTTTGCT AAACAAAATC GAGACAACCC AGACTATGAA 
GCAATCGAAC GGTCAAAAGT CTATTCCTTA CGTTTAAACC AAAAAAGAAA CGAAAAACCA 
TCCATTTTTG CAAATGAGAA TGTCCGCAAA GCa?TTAGCTT ATGCTTTGGA TAAAAAAAGT 
TTAGTCGATA ATATTTTAGC AGATGGCTCA AAAGAAATTT ATGGGTACAT TCCAGAAAAA 
TTTGTATATA ACCCAGAAAC GAATGAAGAT TTTCGTCAAG AAGCAGGCGC TCTTGTCAAA 
ACAGACGCCA AAAAAGCCAA AGAGTATTTA GATAAAGCAA AAGCAGAGCT AAACGGAGAT 
GTAGCCATTG AACTTCTTTC AAGAGATGGT GATAGTGACC GA 

EF030-2 (SEQ ID N0:114) 

MKK LKMMGIMLFV STVLVGCGTT AXTKIDEKAT EKTSVSKKVL NLMENSEIGS 
MDSIFTQDEA SINAQSNVFE GLYQLDEKDQ LIPAAAKEMP EISEDGKRYT IKLREDGKWS 
NGDAVTANDF VFAWRKLANP KNQANYFFLL EGTILNGTAI TKEEKAPEEL GVKALDDYTL 
EVTLEKPVPY FTSLLAFSPF FPQNEAFVKE KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW 
DFVRNPYYYD KEKVKSETIH FEVLKETNTV YNLYESGELD VAVLTGDFAK QNRDNPDYEA 
lERSKVYSLR LNQKRNEKPS IFANENVRKA LAYALDKKSL VDNILADGSK EIYGYIPEKF 
VYNPETNEDF RQEAGALVKT DAKKAKEYLD KAKAELNGDV AIELLSRDGD SDR 

EF030-3 (SEQ ID N0:115) 

GAGAAAGCA 

ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT TTAAATTTAA TGGAGAACTC GGAAATCGGT 
TCAATGGATT CTATTTTTAC ACAAGATGAA GCCAGTATTA ACGCACAGTC CAATGTCTTT 
GAAGGGTTAT ATCAATTGGA TGAAAAAGAT CAACTAATAC CTGCTGCTGC TAAAGAGATG 
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CCAGAAATTT CTGAGGATGG CAAACGATAT ACCATTAAAC TAAGAGAAGA TGGCAAGTGG 
TCCAATGGTG ATGCTGTAAC AGCCAATGAT TTCGTTTTTG CTTGGCGTAA ATTAGCGAAT 
CCCAAAAACC AAGCCAATTA CTTTTTCTTG TTAGAAGGAA CGATTCTGAA CGGAACAGCT 
ATTACAAAAG AGGAAAAAGC ACCAGAGGAA TTGGGTGTCA AAGCGCTTGA TGATTATACT 
TTGGAGGTTA CTTTAGAAAA GCCTGTACCA TATTTTACGT CGTTATTGGC ATTTTCTCCA 
TTTTTCCCAC AAAACGAAGC ATTCGTGAAA GAAAAAGGAC AAGCCTATGG CACTTCTAGT 
GAAATGATTG TATCTAATGG TCCGTTTTTA ATGAAAAATT GGGATCAGTC AGCGATGTCG 
TGGGATTTTG TGCGTAATCC CTACTATTAC GATAAAGAAA AAGTAAAATC AGAAACGATT 
CATTTTGAAG TTCTTAAAGA AACCAATACC GTTTATAATT TGTACGAATC AGGTGAATTA 
GATGTGGCTG TCTTAACAGG AGATTTTGCT AAACAAAATC GAGACAACCC AGACTATGAA 
GCAATCGAAC GGTCAAAAGT CTATTCCTTA CGTTTAAACC AAAAAAGAAA CGAAAAACCA 
TCCATTTTTG CAAATGAGAA TGTCCGCAAA GCTTTAGCTT ATGCTTTGGA TAAAAAAAGT 
TTAGTCGATA ATATTTTAGC AGATGGCTCA AAAGAAATTT ATGGGTACAT TCCAGAAAAA 
TTTGTATATA ACCCAGAAAC GAATGAAGAT TTTCGTCAAG AAGCAGGCGC TCTTGTCAAA 
ACAGACGCCA AAAAAGCCAA AGAGTATTTA GATAAAGCAA AAGCAGAGCT AAACGGAGAT 
GTAGCCATTG AACTTCTTTC AAGAGATGGT 

EF030-4 (SEQ ID N0:116) 



EKAT EKTSVSKKVL NLMENSEIGS 

MDSIFTQDEA SINAQSNVFE GLYQLDEKDQ LIPAAAKEMP EISEDGKRYT IKLREDGKWS 
NGDAVTANDF VFAWRKLANP KNQANYFFLL EGTILNGTAI TKEEKAPEEL GVKALDDYTL 
EVTLEKPVPY FTSLLAFSPF FPQNEAFVKE KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW 
DFVRNPYYYD KEKVKSETIH FEVLKETNTV YNLYESGELD VAVLTGDFAK QNRDNPDYEA 
lERSKVYSLR LNQKRNEKPS IFANENVRKA LAYALDKKSL VDNIIiADGSK EIYGYIPEKF 
VYNPETNEDF RQEAGALVKT DAKKAKEYLD KAKAELNGDV AIELLSRDG 

EF031-1 (SEQ ID NO:117) 

TGAGAAATTA GTTATTTTAG AAAAATAAAA ACCATTTTGG AGGAAGATTT AAAAATGAAA 
AAACGCGTAA TTTTAGGGAC ATTAGTCGCT GCAACGTTAT TAATGACTGC TTGTGGAAAC 
AGCGAAGCAA CTACGAAAAG CGAGAGCAAA GGTGGAAGTA ATGCTTTAGT CGTTTCAACT 
TTCGGATTAA GTGAAGATAT TGTCAAAAAA GACATTATCG CTCCATTTGA AAAAGAGAAT 
GAAGCGAAAG TTACCTTAGA AGTAGGCAAT AGCGCAGACC GCTTTACGAA ATTAAAAAAT 
AATCCCAATG CGGGAATTGA TGTCATTGAA TTAGCACAAG CAAATGCAGC ACAAGGTGGA 
AAAGATGGGT TATTTGAAAA AATTACAGAA AAAGAAGTAC CTAATTTAAG TCAGTTAACG 
CCGGGAGCAA AAGAGGTTTT TGAAAGTGGT GCTGGCGTAC CAATCGCTGT AAACAGTATC 
GGGATTGTTT ACAACAAAGA AAAATTAGGC AAAGAAATTA AAAACTGGGA TGACTTATGG 
TCAGCTGATT TGAAAGGTAA AATTTCTGTT CCAGACGTTG CCACGACGGC AGGTCCTTTA 
ATGTTATACG TTGCTAGTGA ACATGCTGGT CAAGATATTA CAAAAGATAA CGGGAAGGCC 
GCTTTTGAAG CGATGAAAGA ATTAAAACCA AACGTTGTTA AAACGTATTC AAAATCGTCA 
GACTTAGCNA ATATGTTCCA ATCTGGTGAA ATTGAAGCAG COXSTGGTTGC TGATTTTGCG 
GTTGATATTA TTCAAGGCGC ACAGAAAACG TGA 

EFO031-2 (SEQ ID NO: 118) 

MKK RVILGTLVAA TLLMTACGNS EATTKSESKG GSNALWSTF 

GLSEDIVKKD IIAPFEKENE AKVTLEVGNS ADRFTKLKNN PNAGIDVIEL AQANAAQGGK 
DGLFEKITEK EVPNLSQLTP GAKEVFESGA GVPIAVNSIG IVYNKEKLGK EIKNWDDLWS 
ADLKGKISVP DVATTAGPLM LYVASEHAGQ DITKDNGKAA FEAMKELKPN WKTYSKSSD 
LANMFQSGEI EAAWADFAV DIIQGAQKT 



EF031-3 (SEQ ID N0:119) 
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AA CTACGAAAAG CGAGAGCAAA GGTGGAAGTA ATGCTTTAGT CGTTTCAACT 
TTCGGATTAA GTGAAGATAT TGTCAAAAAA GACATTATCG CTCCATTTGA AAAAGAGAAT 
GAAGCGAAAG TTACCTTAGA AGTAGGCAAT AGCGCAGACC GCTTTACGAA ATTAAAAAAT 
AATCCCAATG CGGGAATTGA TGTCATTGAA TTAGCACAAG CAAATGCAGC ACAAGGTGGA 
AAAGATGGGT TATTTGAAAA AATTACAGAA AAAGAAGTAC CTAATTTAAG TCAGTTAACG 
CCGGGAGCAA AAGAGGTTTT TGAAAGTGGT GCTGGCGTAC CAATCGCTGT AAACAGTATC 
GGGATTGTTT ACAACAAAGA AAAATTAGGC AAAGAAATTA AAAACTGGGA TGACTTATGG 
TCAGCTGATT TGAAAGGTAA AATTTCTGTT CCAGACGTTG CCACGACGGC AGGTCCTTTA 
ATGTTATACG TTGCTAGTGA ACATGCTGGT CAAGATATTA CAAAAGATAA CGGGAAGGCC 
GCTTTTGAAG CGATGAAAGA ATTAAAACCA AACGTTGTTA AAACGTATTC AAAATCGTCA 
GACTTAGCNA ATATGTTCCA ATCTGGTGAA ATTGAAGCAG CTGTGGTTGC TGATTTTGCG 
GTTGATATTA TTCAAGGCGC ACAGAAAA 

EF031-4 (SEQ ID NO:120) 

TTKSESKG GSNALWSTF 

GLSEDIVKKD IIAPFEKENE AKVTLEVGNS ADRFTKLKNN PNAGIDVIEL AQANAAQGGK 
DGLFEKITEK EVPNLSQLTP GAKEVFESGA GVPIAVNSIG IVYNKEKLGK EIKNWDDLWS 
ADLKGKISVP DVATTAGPLM LYVASEHAGQ DITKDNGKAA FEAMKELKPN WKTYSKSSD 
LANMFQSGEI EAAWADFAV DIIQGAQK 



EF032-1 (SEQ ID NO: 121) 

TGAATAAATT ATTTAGGAGG AATTATGATG 
GTTTGTGGTA TTTCACTACT TACTGCTTGT 
AAGTCAACCA GTCAATCTAG CAGCACAGTT 
TCAGGGGAAT ATTCAGTTGG AAAAGATATT 
CAACTAGATG ATAAATCGAG CATAGTTCTT 
AACCATGACT TATACGGAGT GGGAAACAAG 
CTCACATTCG AAACTGCCGA C7UWVGATTTT 
CAAGAATATA TGAAAAATCC AGTATCNAGT 
TCTGATGTTT CTAAAAGTAG TAGCCAAGAT 
GAAGTAAGTA CTGAAGCGAA GTCTGATGTA 
AATACTAATG ACATTACTAA GCTAGCAGAT 
GATACTTTAG CTAAGCATCA ATTTAATGAT 
TCAATTATCG GCGTCATCCC AACCATGGAC 



AAAAAATTAA TTAGTTTAGG ATTGGTTTGT 
NCGGGAAATA ATGATAATAA AGATACTGAA 
AAACAACCGA ATTCAAAAGA CTTTGTTGCG 
GATCCTGGAG ATTACTATGC TGTATTAACT 
ATTACCGTCA AATCAGGCGG AGAAAATAGT 
AAAAAAGTAT CTCTTAAAAA GGGAGATACT 
GTTGTTAGAT TTTTAAATGA AAAAGATTTT 
ACTGAAACTA GCAAACANAA AACAGTAAAC 
AATAAACAAT CTGATGTATC TGAAAAAAAA 
GCTACTAATA CTTTACCGAG CGAAGATAAA 
GAGCCAACCT TAGAACAACA AACCGTCTTA 
ATGTATCCTT ATAAAGGAAG CAAAATGCAT 
GCAAAAAGAT GGTAA 



EF032-2 {SEQ ID NO: 122) 

MK KLISIiGLVCV CGISLLTACX GNNDNKDTEK STSQSSSTVK QPNSKDFVAS 
GEYSVGKDID PGDYYAVLTQ LDDKSSIVLI TVKSGGENSN HDLYGVGNKK KVSLKKGDTL 
TFETADKDFV VRFLNEKDFQ EYMKNPVSST ETSKXKTVNS DVSKSSSQDN KQSDVSEKKE 
VSTEAKSDVA TNTLPSEDKN TNDITKLADE PTLEQQTVLD TLAKHQFNDM YPYKGSKMHS 
IIGVIPTMDA KRW 



EF032-3 (SEQ ID NO: 123) 
TA ATGATAATAA AGATACTGAA 

AAGTCAACCA GTCAATCTAG CAGCACAGTT AAACAACCGA ATTCAAAAGA CTTTGTTGCG 
TCAGGGGAAT ATTCAGTTGG AAAAGATATT GATCCTGGAG ATTACTATGC TGTATTAACT 



wo 98/50554 



PCT/US98/08959 



113 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

CAACTAGATG ATAAATCGAG CATAGTTCTT ATTACCGTCA AATCAGGCGG AGAAAATAGT 
AACCATGACT TATACGGAGT GGGAAACAAG AAAAAAGTAT CTCTTAAAAA GGGAGATACT 
CTCACATTCG AAACTGCCGA CAAAGATTTT GTTGTTAGAT TTTTAAATGA AAAAGATTTT 
CAAGAATATA TGAAAAATCC AGTATCNAGT ACTGAAACTA GCAAACANAA AACAGTAAAC 
TCTGATGTTT CTAAAAGTAG TAGCCAAGAT AATAAACAAT CTGATGTATC TGAAAAAAAA 
GAAGTAAGTA CTGAAGCGAA GTCTGATGTA GCTACTAATA CTTTACCGAG CGAAGATAAA 
AATACTAATG ACATTACTAA GCTAGCAGAT GAGCCAACCT TAGAACAACA AACCGTCTTA 
GATACTTTAG CTAAGCATCA ATTTAATGAT ATGTATCCTT ATAAAGGAAG CAAAATGCAT 
TCAATTATCG GCGTCATCCC AACCATGGAC GCAAAAAGAT GG 



EF032-4 (SEQ ID NO: 124) 

NDNKDTEK STSQSSSTVK QPNSKDFVAS 
GEYSVGKDID PGDYYAVLTQ LDDKSSIVLI 
TFETADKDFV VRFLNEKDFQ EYMKNPVSST 
VSTEAKSDVA TNTLPSEDKN TNDITKLADE 
IIGVIPTMDA KRW 



TVKSGGENSN HDLYGVGNKK KVSLKKGDTL 
ETSKXKTVNS DVSKSSSQDN KQSDVSEKKE 
PTLEQQTVLD TLAKHQFNDM YPYKGSKMHS 



EF033-1 (SEQ ID NO: 125) 

TGACTGCTTT TTTTCTATTG GAGAAAAAAG 
CAAAGGAGGT TCATTTCAGA AAATTTTCCC 
AAAATGAAAA AATTTACTTT AACAATGATG 
GCAGGATGTG GTAAACAGGA AAAGAAAGCA 
TTACCAACCA AAGACCGTAG CGGCAAAGAA 
ATTTCCCTAG TGCCATCAAC AACAGAAGTG 
ATCGCAGTTG ATACTCAAAG TAGTACAATG 
GATATGATGG CTGTCGATGC CGAAAAATTG 
AATGACATCA ATTTAGCTAG CTCAGAAAGT 
ACAGTCGTTA ATATCCCCAC TAGTACAAGC 
ATCGCTGATA GCTTATCTGA ACATGAAAAA 
GAAATCGACG AGTAG 



TGGTTTTTTT GTATTGTTTT GACGTTGAGA 
CAAAATAAAA TAGACGAATG CGAGGATGAA 
ACTTTAGGTT TAGTAGCAAC ACTTGGCTTA 
ACTACCTCTT CTGAAAAAAC AGAAGTAACG 
ATTACTTTAC CCAAAGAAGC AACCAAAATT 
ATTGAAGACT TAGGTAAAAC CGACCAATTA 
ATGACTGATT TAAAAAAATT ACCACAAATG 
ATTGCCTTGA AACCACAAAT TGTTTATGTG 
GTTTGGAAGC AAGTGGAAGA TGCTGGAATT 
ATCAAAGCAA TCAAAGAAGA CGTCCAATTC 
GGACAAAAGT TAATCAAAAC AATGGATCAA 



EF033-2 (SEQ ID NO:126) 
MKKFTLTMMT LGLVATLGLA 

GCGKQEKKAT TSSEKTEVTL PTKDRSGKEI TLPKEATKII SLVPSTTEVI EDLGKTDQLI 
AVDTQSSTMM TDLKKLPQMD MMAVDAEKLI ALKPQIVYVN DINLASSESV WKQVEDAGIT 
WNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTMDQE IDE 

EF033-3 (SEQ ID NO:127) 

CTCTT CTGAAAAAAC AGAAGTAACG 

TTACCAACCA AAGACCGTAG CGGCAAAGAA ATTACTTTAC CCAAAGAAGC AACCAAAATT 
ATTTCCCTAG TGCCATCAAC AACAGAAGTG ATTGAAGACT TAGGTAAAAC CGACCAATTA 
ATCGCAGTTG ATACTCAAAG TAGTACAATG ATGACTGATT TAAAAAAATT ACCACAAATG 
GATATGATGG CTGTCGATGC CGAAAAATTG ATTGCCTTGA AACCACAAAT TGTTTATGTG 
AATGACATCA ATTTAGCTAG CTCAGAAAGT GTTTGGAAGC AAGTGGAAGA TGCTGGAATT 
ACAGTCGTTA ATATCCCCAC TAGTACAAGC ATCAAAGCAA TCAAAGAAGA CGTCCAATTC 
ATCGCTGATA GCTTATCTGA ACATGAAAAA GGACAAAAGT TAATCAAAAC AATGGATCAA 
GAAATCGACG AGTAG 
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EF033-4 (SEQ ID NO: 128) 

SSEKTEVTL PTKDRSGKEI TLPKEATKII SLVPSTTEVI .EDLGKTDQLI 

AVDTQSSTMM TDLKKLPQMD MMAVDAEKLI ALKPQIVYVN DINLASSESV WKQVEDAGIT 

WNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTMDQE IDE 

EF034-1 (SEQ ID WO:129) 

TAGGAGGGAG TAATCATGAA AAAAATCGGG TATTTTAGTT GTATTATTTT TTTCATGTTT 
TTGGTAGGTT GTAGTAATAA CAAAAAAGAA AACGGCAATC TTTTGAATGC CAGTTCGTTT 
CCTTTAATAC TCACCACGAT TATTGAAAAA GAAGAAGACC TAACGAAAGG TTCAATTTTT 
TTCAACAAGG ATAAAACCAT GACGCTTGAA AAAGAATATT TAGTTAATCC CAATAATGAA 
GACACAAAAA AAACAAGTAG AACAGAAAAA AAGGTATATA AAAATATTAA AATACAAGAA 
AATAAAGAGA GCTATGAAAT TATAGGTCAA TTGGACAAAA AAACGAAAAA AATAGAGTTT 
AAAAAAGTTG ATGAAGGTAA ACGTATATCT GATGCAGAAG GTAATGTGTA TGGTGATTTT 
GGTGGTAAAT AG 

EF034-2 (SEQ ID NO:130) 

MKKIGY FSCIIFFMFL VGCSNNKKEN GNLLNASSFP LILTTIIEKE EDLTKGSIFF 
NKDKTMTLEK EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEIIGQL DKKTKKIEFK 
KVDEGKRISD AEGNVYGDFG GK 

EF034-3 (SEQ ID N0:131) 

AGAA AACGGCAATC TTTTGAATGC CAGTTCGTTT 

CCTTTAATAC TCACCACGAT TATTGAAAAA GAAGAAGACC TAACGAAAGG TTCAATTTTT 
TTCAACAAGG ATAAAACCAT GACGCTTGAA AAAGAATATT TAGTTAATCC CAATAATGAA 
GACACAAAAA AAACAAGTAG AACAGAAAAA AAGGTATATA AAAATATTAA AATACAAGAA 
AATAAAGAGA GCTATGAAAT TATAGGTCAA TTGGACAAAA AAACGAAAAA AATAGAGTTT 
AAAAAAGTTG ATGAAGGTAA ACGTATATCT GATGCAGAAG GTAATGTGTA TGGTGATTTT 
GGTGGTAAAT AG 

EF034-4 (SEQ ID NO: 132) 

KEN GNLLNASSFP LILTTIIEKE EDLTKGSIFF 

NKDKTMTLEK EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEIIGQL DKKTKKIEFK 
KVDEGKRISD AEGNVYGDFG GK 



EF035-1 (SEQ ID NO:133) 

TAAACGAGAG GTGAGTTTAT GAAAACAAAA 
TTATTCACAA GTTTCCTTTT ACTGAGTGGT 
ACAATTGATC GACAGAAAGA AAAAGTCGAT 
GAAAATTCCA TGGAAAGTTA CGACGAAAAA 
AAAATCGATA CTACTGAGTA A 

EF035-2 (SEQ ID NO:134) 



ATCGGAAAAA CAGTTATCTT GTCAGCATTT 
TGTACCTCGG CTGGCGAAX3A GATGGAAAAA 
AAAACGGTCG ATAAGCAGAA ACATAAAAAT 
GTTGACCGTT CTTTAGATAG TCAAGAAGAC 



MKTKI GKTVILSAFL' FTSFLLLSGC TSAGEEMEKT IDRQKEKVDK TVDKQKHKNE 
NSMESYDEKV DRSLDSQEDK IDTTE 
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EF035-3 (SEQ ID NO: 135) 
GATGGAAAAA 

ACAATTGATC GACAGAAAGA AAAAGTCGAT AAAACGGTCG ATAAGCAGAA ACATAAAAAT 
GAAAATTCCA TGGAAAGTTA CGACGAAAAA GTTGACCGTT CTTTAGATAG TCAAGAAGAC 
AAAATCGATA CTACTGAG 

EF03 5-4 (SEQ ID NO: 13 6) 



MEKT IDRQKEKVDK TVDKQKHKNE 
NSMESYDEKV DRSLDSQEDK IDTTE 



EF036-1 (SEQ ID NO:137) 

TAATTTTCAA GTCCTACATA TAATGGTAAA ATAGAATGGA TTGAAATTAA TTGGAGGAAT 
AATGAATCGA TGAAAAAAAG ATTGCTATTA TTTATTGGTT TGGCAAGTAT ACTTACTTTG 
ACAGGATGTG CAAAATGGAT TGATCGTGGT GAATCCATCA CAGCGGTAGG CTCATCAGCT 
TTACAACCAT TAGTAGAGAC AGCGAGTGAG GAATATCAAA GCCAAAATCC GGGAAGATTT 
ATTAATGTCC AAGGTGGCGG AAGCGGAACA GGTCTGAGTC AAGTCCAATC TGGCGCGGTA 
GACATTGGTA ATTCTGATTT ATTTGCAGAA GAGAAAAAGG GCATCAAAGC GGAAGACTTA 
ATTGATCATA AAGTTGCTGT CGTTGGGATT ACACCAATCG TTAACAAAAA TGTCGGTGTC 
AAAGATATCT CAATGGAAAA TTTAAAGAAA ATCTTTTTAG GTGAAGTAAC AAACTGGAAA 
GAACTTGGCG GGAAAGACCA AAAAATTGTT ATTTTGAATA GAGCGGCCGG TAGTGGTACG 
CGTGCGACTT TTGAAAAGTG GGTCTTGGGA GATAAAACAG CCATTCGTGC GCAAGAACAA 
GATTCCAGCG GCATGGTTCG TTCCATTGTT TCTGATACAC CAGGAGCGAT TAGTTATACC 
GCATTTTCAT ATGTTACTGA TGAAGTAGCT ACGTTAAGTA TTGATGGTGT TCAGCCAACA 
GATGAAAATG TAATGAACAA TAAATGGATT ATTTGGTCTT ATGAACACAT GTACACTCGT 
AAAAATCCAA GTGATTTAAC CAAAGAGTTT TTAGACTTTA TGTTGTCAGA TGATATCCAA 
GAACGTGTGA TTGGTCAATT AGGGTATATT CCTGTTTCGA AAATGGAAAT TGAACGGGAT 
TGGCAAGGAA ATGTCATTAA ATAA 

EF-36-2 (SEQ ID NO: 138) 

MKKRLLLF IGLASILTLT GCAKWIDRGE SITAVGSSAL 

QPLVETASEE YQSQNPGRFI NVQGGGSGTG LSQVQSGAVD IGNSDLFAEE KKGIKAEDLI 
DHKVAWGIT PIVNKNVGVK DISMENLKKI FLGEVTNWKE LGGKDQKIVI LNRAAGSGTR 
ATFEKWVLGD KTAIRAQEQD SSGMVRSIVS DTPGAISYTA FSYVTDEVAT LSIDGVQPTD 
ENVMNNKWII WSYEHMYTRK NPSDLTKEFL DFMLSDDIQE RVIGQLGYIP VSKMEIERDW 
QGNVIK 

EF03 6-3 (SEQ ID NO: 13 9) 

GAT TGATCGTGGT GAATCCATCA CAGCGGTAGG CTCATCAGCT 

TTACAACCAT TAGTAGAGAC AGCGAGTGAG GAATATCAAA GCCAAAATCC GGGAAGATTT 
ATTAATGTCC AAGGTGGCGG AAGCGGAACA GGTCTGAGTC AAGTCCAATC TGGCGCGGTA 
GACATTGGTA ATTCTGATTT ATTTGCAGAA GAGAAAAAGG GCATCAAAGC GGAAGACTTA 
ATTGATCATA AAGTTGCTGT CGTTGGGATT ACACCAATCG TTAACAAAAA TGTCGGTGTC 
AAAGATATCT CAATGGAAAA TTTAAAGAAA ATCTTTTTAG GTGAAGTAAC AAACTGGAAA 
GAACTTGGCG GGAAAGACCA AAAAATTGTT ATTTTGAATA GAGCGGCCGG TAGTGGTACG 
CGTGCGACTT TTGAAAAGTG GGTCTTGGGA GATAAAACAG CCATTCGTGC GCAAGAACAA 
GATTCCAGCG GCATGGTTCG TTCCATTGTT TCTGATACAC CAGGAGCGAT TAGTTATACC 
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GCATTTTCAT ATGTTACTGA TGAAGTAGCT 
GATGAAAATG TAATGAACAA TAAATGGATT 
AAAAATCCAA GTGATTTAAC CAAAGAGTTT 
GAACGTGTGA TTGGTCAATT AGGGTATATT 
TGGCAAGGAA ATGTCATTAA A 

EF036-4 (SEQ ID NO: 140) 



ACGTTAAGTA TTGATGGTGT TCAGCCAACA 
ATTTGGTCTT ATGAACACAT GTACACTCGT 
TTAGACTTTA TGTTGTCAGA TGATATCCAA 
CCTGTTTCGA AAATGGAAAT ■TGAAC<3GGAT 



IDRGE SITAVGSSAL 
QPLVETASEE YQSQNPGRFI 
DHKVAWGIT PIVNKNVGVK 
ATFEKWVLGD KTAIRAQEQD 
ENVMNNKWII WSYEHMYTRK 
QGNVIK 



NVQGGGSGTG LSQVQSGAVD 
DISMENLKKI FLGEVTNWKE 
SSGMVRSIVS DTPGAISYTA 
NPSDLTKEFL DFMLSDDIQE 



IGNSDLFAEE KKGIKAEDLI 
LGGKDQKIVI LNRAAGSGTR 
FSYVTDEVAT LSIDGVQPTD 
RVIGQLGYIP VSKMEIERDW 



EF037-1 (SEQ ID NO: 141) 



TGAGTGTATG ATTACTCATT TCCCTTTGAA TCAGTTATGA TAAAGGAAGA AATAAATAAA 
TTTTTTGGAG GGATTTTCAT GAAAATGTCT AAAGTACTCA CCACTGTTTT GACGGCAACT 
GCTGCTCTTG TGTTGCTTAG TGCTTGTTCA TCTGATAAAA AAACAGATAG TAGTTCTAGT 
AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC 
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT 
TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT 
GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT CAACATCGTT 
CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC 
AAAGTTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC TGACTTAOAA 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA ATAA 

EF037-2 (SEQ ID NO: 142) 

MKMSK VLTTVLTATA ALVLLSACSS DKKTDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 



EF037-3 (SEQ ID NO: 143) 



AACAGATAG TAGTTCTAGT 
AGCAAAGAAA CAGCTAATTC AAGTACAGAA 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT 
AATGTCACTT TTGATAAAGA GGTAACAGTT 
TCCAACGATG TCTATCGTAA ATTAGCACTT 
GCTGAATATG AAATCACGGT TCCTAAGCTA 
CACGGGACTG TCAAAGGTGA TATTGAGGTG 
AAAGTTAATG GCAATATTAC TTTTGATAAA 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC 

EF037-4 (SEQ ID NO: 144) 



GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
AAAGGAAATT GGATTGTCGC AGCTACTGAC 
GCTGGTACTT TCCATGATAA GGGGAAAGAT 
TATTCCCAAG ATGATAATAA AAAAGTAACT 
ATCGTTTCTT CTGAAAATTT CAACATCGTT 
AAAGCAAATG GCTTTACTTT AAATGGTACC 
CAAGAATACA AAGATTCTGC TGACTTAGAA 
ACCGTAGCCA A 



TDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
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EF038-1 (SEQ ID NO: 145) 

TAATGGCCAT TTCGTCTACT AATAAAGAGG ATGAAGCTAC TCAAATGGCG TTGGCAATGG 
AACAAGGATC ATAAAAAAGG AGAAGTGAGC ATGAAAAAAG TACTACCTTT TATTGCCTTA 
GTCGGCTTGT TATTGTTGTC AGGTTGTGGA ACAGATATGA AAAAGATATT GACTGCCGAT 
GGTGGTAAAT GGGAACTAGA AAATAAAAGT CCAACTACTA CTTACACTTT TTTTGATGAT 
GAAACTTTTT CGAGGTATAA TTCAAAAATT AGTGATAGTG GAACGTACTC TTACGATGAA 
AATAATAAAA AACTCACTTT GGATATAAAA AATAAAGAAC AATTAATAAT GGAAAATGTT 
GAATATAAAG ACGGTAAATT AAAAGGTGAA ATTGGAGGCG AGAAGGACTC TGATAAAAAA 
TNGAATAAGA GGTGTCTTTG A 

EF038-2 (SEQ ID NO: 146) 

M KLLKWRWQWN KDHKKGEVSM KKVLPFIALV GLLLLSGCGT DMKKILTADG 
GKWELENKSP TTTYTFFDDE TFSRYNSKIS DSGTYSYDEN NKKLTLDIKN KEQLIMENVE 
YKDGKLKGEI GGEKDSDKKX NKRCL 

EF038-3 (SEQ ID NO: 147) 

TTGTGGA ACAGATATGA AAAAGATATT GAG^TGCCGAT 

GGTGGTAAAT GGGAACTAGA AAATAAAAGT CCAACTACTA CTTACACTTT TTTTGATGAT 
GAAACTTTTT CGAGGTATAA TTCAAAAATT AGTGATAGTG GAACGTACTC TTACGATGAA 
AATAATAAAA AACTCACTTT GGATATAAAA AATAAAGAAC AATTAATAAT <X;AAAATGTT 
GAATATAAAG ACGGTAAATT AAAAGGTGAA ATTGGAGGCG AGAAGGACTC TGATAAAAAA 
TNGAATAAGA GGTGTCTTTG A 



EF038-4 (SEQ ID NO: 148) 
CGT DMKKILTADG 

GKWELENKSP TTTYTFFDDE TFSRYNSKIS 
YKDGKLKGEI GGEKDSDKKX NKRCL 



DSGTYSYDEN NKKLTLDIKN KEQLIMENVE 



EF039-1 (SEQ ID NO: 149) 

TAAATATATC AAAAAGAAAA AAGGGGATTA CCAACCATGA AAAAGAAAAA AGTTTTTAGT 
GCGCTTACCT TATTAACCTT TAGTACGTTG TTGATTGCAG GCTGTGCTGG CGGAGCCAAC 
TCTGCAACAG ATAAATCAAG TGCAGCTAGC TCAAGCACTG CAGTCTCTAG TTCAGCAGAA 
GCAGCTAAAG AGCAATCAAA AGGACAAGAA TTAACAGAAA TTTTATCCAG TACTGATTGG 
CAAGGCACAA AAGTTTACGA CAAAAATNAT AATAATTTAA CAGCAGAAAA TGCTAATTTT 
ATTGGTTTAG CAAAATATGA TGGTGAAACA GGTTTTTATG AATTTTTCGA CAAAGAAACA 
GGTGAAACCC GTGGCGATGA AGGCACATTC TTTGTGACAG ACGATGGCGA AAAGCGTATC 
TTAATTTCGG ATACACAAAA CTATCAAGCG GTGGTCGATT TAACGGAAGT GACGAAAGAT 
AAATTTACCT ATAAGCGAAT GGGTAAAGAT AAAGACGGGA AAGATGTAGA AGTCTTTGTA 
GAACATATCC CTTATTCTGA CGAGAAATTA ACCTTTACGA ACGGCCGTAA AGATTTAGAA 
ACAGAAACTG GCAAGATTGT TACCAATGAA CCTGGGGATG ACATTTTAGG GGCCACATTA 
TGGAATGGCA CGAAAGTTTT AGATGAAGAC GGTAACGATG TTACTGAAGC AAATAAAATG 
TTTATTAGTT TAGCGAAATT TGATAATAAA ACAAGTAAAT ATGAATTCTT TGATTTAGAA 
ACGGGTAAAA CACGTGGAGA TTTTGGTTAC TTCCAAGTAA TTGATAATAA CAAAATCCGT 
GCTCACGTTT CAATTGGTGA CAATAAATAT GGAGCTGCAT TAGAATTAAC AGAATTAAAT 
GATAAACGTT TTACGTATAC ACGAATGGGT AAAGACAACA ATGGCAAAGA AATTAAAGTC 
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TTTGTAGAAC ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA 
EF039-2 (SEQ ID NO: 150) 

MKKKKVFSA LTLLTFSTLL lAGCAGGANS ATDKSSAASS STAVSSSAEA 
AKEQSKGQEL TEILSSTDWQ GTKVYDKNXN NLTAENANFI GLAKYDGETG FYEFFDKETG 
ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV VDLTEVTKDK FTYKRMGKDK DGKDVEVFVE 
HIPYSDEKLT FTNGRKDLET ETGKIVTNEP GDDILGATLW NGTKVLDEDG NDVTEANKMF 
ISLAKFDNKT SKYEFFDLET GKTRGDFGYF QVIDNNKIRA HVSIGDNKYG AALELTELND 
KRFTYTRMGK DNNGKEIKVF VEHEPYEGDF TPDFTF 

EF039-3 (SEQ ID N0:151) 

TGCAACAG ATAAATCAAG TGCAGCTAGC TCAAGCACTG CAGTCTCTAG TTCAGCAGAA 
GCAGCTAAAG AGCAATCAAA AGGACAAGAA TTAACAGAAA TTTTATCCAG TACTGATTGG 
CAAGGCACAA AAGTTTACGA CAAAAATNAT AATAATTTAA CAGCAGAAAA TGCTAATTTT 
ATTGGTTTAG CAAAATATGA TGGTGAAACA GGTTTTTATG AATTTTTCGA CAAAGAAACA 
GGTGAAACCC GTGGCGATGA AGGCACATTC TTTGTGACAG ACGATGGCGA AAAGCGTATC 
TTAATTTCGG ATACACAAAA CTATCAAGCG GTGGTCGATT TAACGGAAGT GACGAAAGAT 
AAATTTACCT ATAAGCGAAT GGGTAAAGAT AAAGACGGGA AAGATGTAGA AGTCTTTGTA 
GAACATATCC CTTATTCTGA CGAGAAATTA ACCTTTACGA ACGGCCGTAA AGATTTAGAA 
ACAGAAACTG GCAAGATTGT TACCAATGAA CCTGGGGATG ACATTTTAGG GGCCACATTA 
TGGAATGGCA CGAAAGTTTT AGATGAAGAC GGTAACGATG TTACTGAAGC AAATAAAATG 
TTTATTAGTT TAGCGAAATT TGATAATAAA ACAAGTAAAT ATGAATTCTT TGATTTAGAA 
ACGGGTAAAA CACGTGGAGA TTTTGGTTAC TTCCAAGTAA TTGATAATAA CAAAATCCGT 
GCTCACGTTT CAATTGGTGA CAATAAATAT GGAGCTGCAT TAGAATTAAC AGAATTAAAT 
GATAAACGTT TTACGTATAC ACGAATGGGT AAAGACAACA ATGGCAAAGA AATTAAAGTC 
TTTGTAGAAC ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA 

EF039-4 (SEQ ID NO: 152) 

ATDKSSAASS STAVSSSAEA 
AKEQSKGQEL TEILSSTDWQ GTKVYDKNXN NLTAENANFI 
ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV VDLTEVTKDK 
HIPYSDEKLT FTNGRKDLET ETGKIVTNEP GDDILGATLW 
ISLAKFDNKT SKYEFFDLET GKTRGDFGYF QVIDNNKIRA 
KRFTYTRMGK DNNGKEIKVF VEHEPYEGDF TPDFTF 

EF040-1 (SEQ ID NO: 153) 

TAGATTAGAA CCACTGGAGA AAAATCTCAT ATTTCTCTCG AGGAAAGGAA GTTGAGCACA 
ATGAACAAAA AAATTTTAAT GGGGCTATTA AGTGTCGTGA CCATTCCATT ACTTGCTGCG 
TGTCAAGGAG GAGAAACACC TTCCGCAGCG TCAAAAAATA GTCAAACGGT GACTACTCAA 
AGTAGTGCAA AAACTGAAAG CACCAGTACA ACCCGTTCGG TAGCTCAAAC AACATCAAAA 
GAGGAAGTGA AAGAACCGAT GAAGACCTAT GAAGTGGGTG CGCTTTTAGA AGCAGCCAAT 
CAACGAGATA CGAAGAAGGT CAAGGAAATT TTACAAGATA CTACTTATCA AGTGGATGAA 
GTCGACACAG AAGGCAACAC ACCGCTCAAT ATCGCTGTTC ACAATAATGA CATTGAGATT 
GCAAAAGCGT TGATTGATCG GGGTGCCGAT ATTAATCTGC AAAACAGCAT TAGTGATAGT 
CCCTATCTTT ATGCGGGAGC GCAAGGACGT ACGGAGATTT TAGCGTATAT GTTAAAACAT 
GCGACCCCAG ATTTAAATAA GCATAACCGT TACGGTGGCA ATGCGTTAAT TCCGGCAGCT 
GAAAAAGGAC ATATTGACAA TGTGAAGCTC TTGTTAGAAG ATGGACGAGA AGACATAGAT 
TTCCAAAATG ACTTTGGCTA TACAGCATTG ATTGAGGCAG TGGGGTTACG TGAAGGGAAC 
CAACTTTACC AAGATATTGT AAAATTGTTA ATGGAAAATG GTGCGGATCA ATCCATTAAA 
GACAATTCTG GTCGAACAGC AATGGACTAT GCCAATCAAA AAGGTTATAC GGAAATTAGT 



GLAKYDGETG FYEFFDKETG 
FTYKRMGKDK DGKDVEVFVE 
NGTKVLDEDG NDVTEANKMF 
HVSIGDNKYG AALELTELND 
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AAAATTTTAG CACAGTACAA CTAA 
EF040-2 (SEQ ID NO: 154) 



M NKKILMGLLS WTIPLLAAC QGGETPSAAS KNSQTVTTQS 

SAKTESTSTT RSVAQTTSKE EVKEPMKTYE VGALLEAANQ RDTKKVKEIL QDTTYQVDEV 
DTEGNTPLNI AVHNNDIEIA KALIDRGADI NLQNSISDSP YLYAGAQGRT EILAYMLKHA 
TPDLNKHNRY GGNALIPAAE KGHIDNVKLL LEDGREDIDF QNDFGYTALI EAVGLREGNQ 
LYQDIVKLLM ENGADQSIKD NSGRTAMDYA NQKGYTEISK ILAQYN 



EF040-3 (SEQ ID NO:155) 



AGCG TCAAAAAATA GTCAAi 
AGTAGTGCAA AAACTGAAAG 
GAGGAAGTGA AAGAACCGAT 
CAACGAGATA CGAAGAAGGT 
GTCGACACAG AAGGCAACAC 
GCAAAAGCGT TGATTGATCG 
CCCTATCTTT ATGCGGGAGC 
GCGACCCCAG ATTTAAATAA 
GAAAAAGGAC ATATTGACAA 
TTCCAAAATG ACTTTGGCTA 
CAACTTTACC AAGATATTGT 
GACAATTCTG GTCGAACAGC 
AAAATTTTAG CACAGTACAA 

EF040-4 (SEQ ID N0:15i 



iCGGT GACTACTCAA 
CACCAGTACA ACCCGTTCGG 
GAAGACCTAT GAAGTGGGTG 
CAAGGAAATT TTACAAGATA 
ACCGCTCAAT ATCGCTGTTC 
GGGTGCCGAT ATTAATCTGC 
GCAAGGACGT ACGGAGATTT 
GCATAACCGT TACGGTGGCA 
TGTGAAGCTC TTGTTAGAAG 
TACAGCATTG ATTGAGGCAG 
AAAATTGTTA ATGGAAAATC 
AATGGACTAT GCCAATCAAA 
C 

i) 



TAGCTCAAAC AACATCAAAA 
CGCTTTTAGA AGCAGCCAAT 
CTACTTATCA AGTGGATGAA 
ACAATAATGA CATTGAGATT 
AAAACAGCAT TAGTGATAGT 
TAGCGTATAT GTTAAAACAT 
ATGCGTTAAT TCCGGCAGCT 
ATGGACGAGA AGACATAGAT 
TGGGGTTACG TGAAGGGAAC 
GTGCGGATCA ATCCATTAAA 
AAGGTTATAC GGAAATTAGT 



AS KNSQTVTTQS 
SAKTESTSTT RSVAQTTSKE 
DTEGNTPLNI AVHNNDIEIA 
TPDLNKHNRY GGNALIPAAE 
LYQDIVKLLM ENGADQSIKD 



EVKEPMKTYE VGALLEAANQ 
KALIDRGADI NLQNSISDSP 
KGHIDNVKLL LEDGREDIDF 
NSGRTAMDYA NQKGYTEISK 



RDTKKVKEIL QDTTYQVDEV 
YLYAGAQGRT EILAYMLKHA 
QNDFGYTALI EAVGLREGNQ 
ILAQYN 



EF041-1 (SEQ ID NO:157) 

TAATTATTAA NTTCTGATTT TTCAGAAAAT 
ATGAAATTGA AAAAGTCATT AACATTCGGT 
GCGGCTTGTG GAGGCGGCGG AACGTCAGAT 
AGTGGCGAAC AAGTTTTACG TGTCACAGAA 
CTAGCAACAG NCAGAATTAG TTTTATTGCA 
TTAGACAAAG ATAACAAAGT CCAACCTGCA 
GATGGACTAA CATACAAAAT TAAATTAAAT 
GTGACTGCTA ATGACTATGT TTACGGATGG 
GAATATGCTT ATCTGTATGC . CTCTGTAAAA 
GATAAATCAG AATTAGGAAT TAAAGCAGTC 
AAAGCAACAC CATACTTTGA TTACTTATTA 
GACATTGTGG AAAAATATGG TAAAAATTAT 
GGTCCATTCG TCTTAGACGG CTTTGATGGT 
AAAAACGATC AATATTGGGA TAAAGATACT 
GTGAAAGAAT CACCAACCGC GTTGAACTTG 
CTTTCTGGTG AATTAGCCCA ACAAATGGCC 
GCATCAACAC AATATATGGA ACTAAATCAA 



ACAGATTGCA TTATTTTAGG AGGCAACACT 
GTGATTACAT TATTTAGCGT AACAACTTTA 
AGCTCAAGCG CGTCTGGTGG CGGTAAGGCA 
CAACAAGAAA TGCCAACAGC TGATTTATCA 
TTAAATAATG TATATGAAGG AATTTATCGT 
GGTGCAGCGG AAAAAGCAGA AGTTTCTGAA 
AAAGATGCAA AATGGTCAGA CGGTAAACCA 
CAACGAACAG TTGATCCAGC GACAGCTTCT. 
AATGGTGATG CCATTGCTAA AGGGGAAAAA 
AGTGATACAG AATTAGAAAT CACTTTAGAA 
GCTTTCCCAT CATTCTTCCC GCAAGGTCAA 
GCATCAAACA GCGAAAGTGC TGTCTACAAT 
CCTGGTACAG ATACAAAATG <3TCATTCAAG 
GTGAAACTGG ACTCAGTAGA TGTGAATGTC 
TTCCAAGATG GACAAACAGA CGATGTCGTT 
AATGACCCAG CTTTTGTTAG TCAAAAAGAA 
CGTGATGAAA AATCACCATT TAGAAATGCG 
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AACTTACGTA AAGCAATTTC TTACTCAATC GACCGTAAAG CGTTAGTTGA ATCAATCCTT 
AGGGGATGG 



EF041-2 (SEQ ID NO: 158) 

M KLKKSLTFGV ITLFSVTTLA . ACGGGGTSDS SSASGGGKAS 

GEQVLRVTEQ QEMPTADLSL ATXRISFIAL NNVYEGIYRL DKDNKVQPAG AAEKAEVSED 
GLTYKIKLNK DAKWSDGKPV TANDYVYGWQ RTVDPATASE YAYLYASVKN <3DAIAKGEKD 
KSELGIKAVS DTELEITLEK ATPYFDYLLA FPSFFPQRQD IVEKYGKNYA SNSESAVYNG 
PFVLDGFDGP GTDTKWSFKK NDQYWDKDTV KLDSVDVNW KESPTALNLF QDGQTDDWL 
SGELAQQMAN DPAFVSQKEA STQYMELNQR DEKSPFRNAN LRKAISYSID RKALVESILR 
GW 



EF041-3 (SEQ ID NO:159) 

TTGTG GAGGCGGCGG AACGTCAGAT AGCTCAAGCG CGTCTGGTGG CGGTAAGGCA 
AGTGGCGAAC AAGTTTTACG TGTCACAGAA CAACAAGAAA TGCCAACAGC TGATTTATCA 
CTAGCAACAG NCAGAATTAG TTTTATTGCA TTAAATAATG TATATGAAGG AATTTATCGT 
TTAGACAAAG ATAACAAAGT CCAACCTGCA GGTGCAGCGG AAAAAGCAGA AGTTTCTGAA 
GATGGACTAA CATACAAAAT TAAATTAAAT AAAGATGCAA AATGGTCAGA CGGTAAACCA 
GTGACTGCTA ATGACTATGT TTACGGATGG CAACGAACAG TTGATCCAGC GACAGCTTCT 
GAATATGCTT ATCTGTATGC CTCTGTAAAA AATGGTGATG CCATTGCTAA AGGGGAAAAA 
GATAAATCAG AATTAGGAAT TAAAGCAGTC AGTGATACAG AATTAGAAAT CACTTTAGAA 
AAAGCAACAC CATACTTTGA TTACTTATTA GCTTTCCCAT CATTCTTCCC GCAACGTCAA 
GACATTGTGG AAAAATATGG TAAAAATTAT GCATCAAACA GCX3AAAGTGC TGTCTACAAT 
GGTCCATTCG TCTTAGACGG CTTTGATGGT CCTGGTACAG ATACAAAATG GTCATTCAAG 
AAAAACGATC AATATTGGGA TAAAGATACT GTGAAACTGG ACTCAGTAGA TGTGAATGTC 
GTGAAAGAAT CACCAACCGC GTTGAACTTG TTCCAAGATG GACAAACAGA CGATGTCGTT 
CTTTCTGGTG AATTAGCCCA ACAAATGGCC AATGACCCAG CTTTTGTTAG TCAAAAAGAA 
GCATCAACAC AATATATGGA ACTAAATCAA CGTGATGAAA AATCACCATT TAGAAATGCG 
AACTTACGTA AAGCAATTTC TTACTCAATC GACCGTAAAG CGTTAGTTGA ATCAATCCTT 
AGGGGATGG 

EF041-4 (SEQ ID NO:160) 
CGGGGTSDS SSASGGGKAS 

GEQVLRVTEQ QEMPTADLSL ATXRISFIAL NNVYEGIYRL DKDNKVQPAG AAEKAEVSED 
GLTYKIKLNK DAKWSDGKPV TANDYVYGWQ RTVDPATASE YAYLYASVKN GDAIAKGEKD 
KSELGIKAVS DTELEITLEK ATPYFDYLLA FPSFFPQRQD IVEKYGKNYA SNSESAVYNG 
PFVLDGFDGP GTDTKWSFKK NDQYWDKDTV KLDSVDVNW KESPTALNLF QDGQTDDWL 
SGELAQQMAN DPAFVSQKEA STQYMELNQR DEKSPFRNAN LRKAISYSID RKALVESILR 
GW 

EF044-1 (SEQ ID NO: 161) 

TAAGATAAAA TTAGTTATAG CGTCTATAGG AGGAATAGTA TGAAAAAATT AGTTTGTGTT 
ATTTTAGTTA TTTTTTTAAC AGGTTGTAGT TCTCAAAAAG CGAATGAACC TAAAAAACAA 
GAAAATTCTA CCAATCATAC AACATCAATA AAAAGCAGTA CTAATCATTA CAGTTCTAGC 
ATAGAAACAA GCTCTAATAA TAAACTAAAA GAAACTTCAG AAAGTGCCAG CAGCACTCAA 
ACTTCGTCAA AGTCGAAAAA TGAAGTATCT ACAAATGTCG AAGAAGCAAA TTCTTTAGAA 
GCAACACCTT ATGCTGTCGA TCTTAGTAGC TTAAACAATC CACTCGTATT TAATTTTAAA 
GGAATGAATG TGCCAACTTC AATTACGTTA GAGAACTTAA ATTCAACACC AACTGCTACC 
TTCCGAACTA AATTGTTTGG GGCTGAAAAT GGTCAAGTGA AAGAAGCCAT TAATAAATAT 
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GAGCTATCTA TAAATACAAT TCCTACAAAA GAGATTAGAA TATTTTCAGC GGCCGATAAC 
AGTATTCGCA CCGTTAAAGT AAATACAGAA TTAATTTTAG GAACTAATAT TTCTTCAAAC 
GATGAACAAA ATAGATCGGG CACTTTATAC TTATTCAACA ATAAAAATGG TTCGATATCT 
TTAATCACTC CTAACTACGC TGGCAATGTT ACGGATGATC AAAAAGACGT TATGCTAGAA 
GTAATTCAAT AA 

EF044-2 {SEQ ID NO: 162) 

MKKLVCVI LVIFLTGCSS QKANEPKKQE NSTNHTTSIK SSTNHYSSSI 

ETSSNNKLKE TSESASTTQT SSKSKNEVST NVEEANSLEA TPYAVDLSSL NNPLVFNFKG 

MNVPTSITLE NLNSTPTATF RTKLFGAENG QVKEAINKYE LSINTIPTKE IRIFSAADNS 

IRTVKVNTEL ILGTNISSND EQNRSGTLYL FNNKNGSISL ITPNYAGNVT DDQKDVMLEV 

IQ 

EF044-3 {SEQ ID NO: 163) 

TTGTAGT TCTCAAAAAG CGAATGAACC TAAAAAACAA 

GAAAATTCTA CCAATCATAC AACATCAATA AAAAGCAGTA CTAATCATTA CAGTTCTAGC 
ATAGAAACAA GCTCTAATAA TAAACTAAAA GAAACTTCAG AAAGTGCCAG CACCACTCAA 
ACTTCGTCAA AGTCGAAAAA TGAAGTATCT ACAAATGTCG AAGAAGCAAA TTCTTTAGAA 
GCAACACCTT ATGCTGTCGA TCTTAGTAGC TTAAACAATC CACTCGTATT TAATTTTAAA 
GGAATGAATG TGCCAACTTC AATTACGTTA GAGAACTTAA ATTCAACACC AACTGCTACC 
TTCCGAACTA AATTGTTTGG GGCTGAAAAT GGTCAAGTGA AAGAAGCCAT TAATAAATAT 
GAGCTATCTA TAAATACAAT TCCTACAAAA GAGATTAGAA TATTTTCAGC GGCCGATAAC 
AGTATTCGCA CCGTTAAAGT AAATACAGAA TTAATTTTAG GAACTAATAT TTCTTCAAAC 
GATGAACAAA ATAGATCGGG CACTTTATAC TTATTCAACA ATAAAAATGG TTCGATATCT 
TTAATCACTC CTAACTACGC TGGCAATGTT ACGGATGATC AAAAAGACGT TATGCTAGAA 
GTAATTCAA 



EF044-4 (SEQ ID NO: 164) 

CSS QKANEPKKQE NSTNHTTSIK SSTNHYSSSI 

ETSSNNKLKE TSESASTTQT SSKSKNEVST NVEEANSLEA TPYAVDLSSL NNPLVFNFKG 
MNVPTSITLE NLNSTPTATF RTKLFGAENG QVKEAINKYE LSINTIPTKE IRIFSAADNS 
IRTVKVNTEL ILGTNISSND EQNRSGTLYL FNNKNGSISL ITPNYAGNVT DDQKDVMLEV 
IQ 

EF045-1 {SEQ ID NO: 165) 

TAGCCAAAAA ATGAGGGAGG AAAAGAGATG AACAAGAAAC GGATTTTAGG TGCAATCACG 
TTAGCTTCTG TGTTAGTATT CGGGTTAGCT GCATGTGGTG GCGGCAATAA AGGCGGGGGC 
AATAAAGCAA CGGAAACAGA AGACATTTCA AAAATGCCAA TCGCTGTTAA AAATGATAAA 
AAAGCAATTG ATGGCGGTAC ATTAGATGTC GCTGTAGTTA TGGATACACA ATTCCAAGGA 
CTTTTCCAGC AAGAATTTTA TCAAGACAAC TATGATGCAC AATACATGCT TCCAACGGTA 
CAGCCATTAT TTAACAATGA TGGAGACTTT AAGATTGTCG ATGGGGGTCC TGCGGATCTG 
AAATTAGATG AAGATGCCAA TACAGCAACC ATTAAATTAC GTGACAATTT GAAATGGTCT 
GACGGTAAAG ATGTGACAGC CGATGACGTG ATTTTCTCTT ATGAAGTCAT TGGTCATAAA 
GACTATACAG GGATTCGTTA TGATGATAAC TTTACGAATA TTGTTGGCAT GGAAGACTAC 
CATGATGGTA AATCGCCAAC CATTTCTGGC ATAGAAAAAG TCAATGATAA AGAAGTTAAA 
ATCACTTATA AAGAAGTTCA CCCAGGAATG CAACAATTAG GTGGCGGTGT TTCGGGCTCA 
GTTTTACCAA AACATGCCTT TGAAGGAATT GCTGTTAAAG ACATGGAATC AAGCGATGCA 
GTTCGTAAAA ACCCTGTGAC TATTGGACCA TACTACATGA GTAATATTGT GACAGGTGAA 
TCTGTTGAAT ACCTACCAAA TGAGCATTAC TACGGTGGTA AACCTAAATT AGATAAATTA 
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GTGTTCAAAT CTGTTCCTTC TGCGAGCATT GTAGAAGCGA TGAAAGCGAA ACAATACGAT 
ATTGCATTAT CAATGCCAAC AGATACGTAT CCAACATACA AAGATACTGA AGGGTATCAA 
ATCTTAGGAC GTCCCGAACA AGCCTACACG TATATTGGCT TTAAAATGGG TACGTTTGAC 
AAAGAAACAA ATACAGTGAA ATACAATCCA AAAGCTAAAA TGGCAGATAA AAGCTTACGT 
CAAGCCATGG GCTATGCAAT TGACAATGAT GCAGTCGGCC AAAAATTCTA CAACGGCTTA 
CGAACAGGGG CAACAACGTT AATCCCACCA GTCTTCAAGA GCTTGCATGA TAGCGAAGCG 
AAAGGCTATA CGCTTGATTT AGACAAAGCG AAAAAATTAT TAGACGATGC TGGTTATAAA 
GACGTAGACG GCGATGGCAT TCGCGAAGAC AAAGAAGGCA AACCACTAGA AATCAAGTTT 
GCTTCAATGT CAGGCGGCGA AACTGCACAA CCACTTGCTG ATTACTATGT CCAACAATGG 
AAAGAAATTG GCTTAAACGT AACGTATACA ACAGGACGCT TAATTGATTT CCAAGCATTC 
TATGATAAAT TGAAAAATGA TGACCCAGAA GTAGATATCT ATCAAGGCGC GTGGGGCACA 
GGTTCAGATC CTTCACCAAC CGGCTTATAT GGTCCAAACT CAGCCTTTAA CTATACACGT 
TTTGAGTCAG AAGAAAATAC TAAATTACTT GATGCGATTG ATTCAAAAGC ATCATTTGAT 
GAAGAAAAAC GTAAAAAAGC CTTCTACGAT TGGCAAGAGT ATGCCATTGA TGAAGCGTTT 
GTAATCCCAA CGCTTTACAG AAATGAAGTC TTGCCTGTCA ACGACCGTGT AGTTGACTTT 
ACTTGGGCAG TTGATACGAA AGATAATCCA TGGGCAACGG TGGGTGTCAC AGCAGACTCA 
CGGAAATAA 

EP045-2 (SEQ ID NO: 166) 

MN KKRILGAITL ASVLVFGLAA CGGGNKGGGN KATETEDISK MPIAVKNDKK 
AIDGGTLDVA WMDTQFQGL FQQEFYQDNY DAQYMLPTVQ PLFNNDADFK IVDGGPADLK 
LDEDANTATI KLRDNLKWSD GKDVTADDVI FSYEVIGHKD YTGIRYDDNF TNIVGMEDYH 
DGKSPTISGI EKVNDKEVKI TYKEVHPGMQ QLGGGVWGSV LPKHAFEGIA VKDMESSDAV 
RKNPVTIGPY YMSNIVTGES VEYLPNEHYY GGKPKLDKLV FKSVPSASIV EAMKAKQYDI 
ALSMPTDTYP TYKDTEGYQI LGRPEQAYTY IGFKMGTFDK ETNTVKYNPK AKMADKSLRQ 
AMGYAIDNDA VGQKFYNGLR TGATTLIPPV FKSLHDSEAK GYTLDLDKAK KLLDDAGYKD 
VDGDGIREDK EGKPLEIKFA SMSGGETAQP LADYYVQQWK EIGLNVTYTT GRLIDFQAFY 
DKLKNDDPEV DIYQGAWGTG SDPSPTGLYG PNSAFNYTRF ESEENTKLLD AIDSKASFDE 
EKRKKAFYDW QEYAIDEAFV IPTLYRNEVL PVNDRWDFT WAVDTKDNPW ATVGVTADSR 
K 

EF045-3 (SEQ ID NO: 167) 
ATGTGGTG GCGGCAATAA AGGCGGGGGC 

AATAAAGCAA CGGAAACAGA AGACATTTCA AAAATGCCAA TCGCTGTTAA AAATGATAAA 
AAAGCAATTG ATGGCGGTAC ATTAGATGTC GCTGTAGTTA TGGATACACA ATTCCAAGGA 
CTTTTCCAGC AAGAATTTTA TCAAGACAAC TATGATGCAC AATACATGCT TCCAACGGTA 
CAGCCATTAT TTAACAATGA TGCAGACTTT AAGATTGTCG ATGGGGGTCC TGCGGATCTG 
AAATTAGATG AAGATGCCAA TACAGCAACC ATTAAATTAC GTGACAATTT GAAATGGTCT 
GACGGTAAAG ATGTGACAGC CGATGACGTG ATTTTCTCTT ATGAAGTCAT TGGTCATAAA 
GACTATACAG GGATTCGTTA TGATGATAAC TTTACGAATA TTGTTGGCAT GGAAGACTAC 
CATGATGGTA AATCGCCAAC CATTTCTGGC ATAGAAAAAG TCAATGATAA AGAAGTTAAA 
ATCACTTATA AAGAAGTTCA CCCAGGAATG CAACAATTAG GTGGCGGTGT TTGGGGCTCA 
GTTTTACCAA AACATGCCTT TGAAGGAATT GCTGTTAAAG ACATGGAATC AAGCGATGCA 
GTTCGTAAAA ACCCTGTGAC TATTGGACCA TACTACATGA GTAATATTGT -GACAGGTGAA 
TCTGTTGAAT ACCTACCAAA TGAGCATTAC TACGGTGGTA AACCTAAATT AGATAAATTA 
GTGTTCAAAT CTGTTCCTTC TGCGAGCATT GTAGAAGCGA TGAAAGCGAA ACAATACGAT 
ATTGCATTAT CAATGCCAAC AGATACGTAT CCAACATACA AAGATACTGA AGGGTATCAA 
ATCTTAGGAC GTCCCGAACA AGCCTACACG TATATTGGCT TTAAAATGGG TACGTTTGAC 
AAAGAAACAA ATACAGTGAA ATACAATCCA AAAGCTAAAA TGGCAGATAA AAGCTTACGT 
CAAGCCATGG GCTATGCAAT TGACAATGAT GCAGTCGGCC AAAAATTCTA CAACGGCTTA 
CGAACAGGGG CAACAACGTT AATCCCACCA GTCTTCAAGA GCTTGCATGA TAGCGAAGCG 
AAAGGCTATA CGCTTGATTT AGACAAAGCG AAAAAATTAT TAGACGATGC TGGTTATAAA 
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GACGTAGACG GCGATGGCAT TCGCGAAGAC AAAGAAGGCA AACCACTAGA AATCAAGTTT 
GCTTCAATGT CAGGCGGCGA AACTGCACAA CCACTTGCTG ATTACTATGT CCAACAATGG 
AAAGAAATTG GCTTAAACGT AACGTATACA ACAGGACGCT TAATTGATTT CCAAGCATTC 
TATGATAAAT TGAAAAATGA TGACCCAGAA GTAGATATCT ATCAAGGCGC GTGGGGCACA 
GGTTCAGATC CTTCACCAAC CGGCTTATAT GGTCCAAACT CAGCCTTTAA CTATACACGT 
TTTGAGTCAG AAGAAAATAC TAAATTACTT GATGGGATTG ATTCAAAAGC ATCATTTGAT 
GAAGAAAAAC GTAAAAAAGC CTTCTACGAT TGGCAAGAGT ATGCCATTGA TGAAGCGTTT 
GTAATCCCAA CGCTTTACAG AAATGAAGTC TTGCCTGTCA ACGACCGTGT AGTTGACTTT 
ACTTGGGCAG TTGATACGAA AGATAATCCA TGGGCAACGG TGGGTGTCAC AGCAGACTCA 
CGGAAA 



EF045-4 (SEQ ID NO: 168) 

CGGGNKGGGN KATETEDISK MPIAVKNDKK 
AIDGGTLDVA WMDTQFQGL FQQEFYQDNY 
LDEDANTATI KLRDNLKWSD GKDVTADDVI 
DGKSPTISGI EKVNDKEVKI TYKEVHPGMQ 
RKNPVTIGPY YMSNIVTGES VEYLPNEHYY 
ALSMPTDTYP TYKDTEGYQI LGRPEQAYTY 
AMGYAIDNDA VGQKFYNGLR TGATTLIPPV 
VDGDGIREDK EGKPLEIKFA SMSGGETAQP 
DKLKNDDPEV DIYQGAWGTG SDPSPTGLYG 
EKRKKAFYDW QEYAIDEAFV IPTLYRNEVL 
K 



DAQYMLPTVQ PLFNNDADFK IVDGGPADLK 
FSYEVIGHKD YTGIRYDDNF TNIVGMEDYH 
QLGGGVWGSV LPKHAFEGIA VKDMESSDAV 
GGKPKLDKLV FKSVPSASIV EAMKAKQYDI 
IGFKMGTFDK ETNTVKYNPK AKMADKSLRQ 
FKSIiHDSEAK GYTLDLDKAK KLLDDAGYKD 
LADYYVQQWK EIGLNVTYTT GRLIDFQAFY 
PNSAFNYTRF ESEENTKLLD AIDSKASFDE 
PVNDRWDFT WAVDTKDNPW ATVGVTADSR 



EF046-1 (SEQ ID NO: 169) 

TAGGAGGATA TAATGAAAAA AAAACTTATT 
TGTAGTAATA ATACTGGGGG AAAAAATAGC 
CAGCAAACTA CCCAGTCTTC TAAAAAAGAT 
ACATCATCTA TAACAATTGA AACAACCGAG 
GATGATGTTT CAAAAACTAG ACGACAATTG 
ACGGATAAAG AACTAAAGGA ATATATATCA 
AATTATATTA AGCAAAAA 



GTACTATTGT TAGCCTTATT TTTAACGGCA 
GACGCTTCAT CTACTGAAGT ATCAACTAAG 
AGTAGTAATC CGGACACAAC ACCAACTTCT 
AATTTAAAGA ATAGAGAATT GAATCCAACA 
TATGAACAAG GAATTAACAG TTCAACAATT 
GAGGCTAAAG AACAAAAGAA AGATGTCATT 



EF046-2 (SEQ ID NO: 170) 

MKKKLIV LLLALFLTAC SNNTGGKNSD ASSTEVSTKQ QTTQSSKKDS SNPDTTPTST 
SSITIETTEN LKNRELNPTD DVSKTRRQLY EQGINSSTIT DKELKEYISE AKEQKKDVIN 
YIKQK 



EF046-3 (SEQ ID NO: 171) 
A 

TGTAGTAATA ATACTGGGGG AAAAAATAGC 
CAGCAAACTA CCCAGTCTTC TAAAAAAGAT 
ACATCATCTA TAACAATTGA AACAACCGAG 
GATGATGTTT CAAAAACTAG ACGACAATTG 
ACGGATAAAG AACTAAAGGA ATATATATCA 
AATTATATTA AGCAAAAA 



GACGCTTCAT CTACTGAAGT ATCAACTAAG 
AGTAGTAATC CGGACACAAC ACCAACTTCT 
AATTTAAAGA ATAGAGAATT GAATCCAACA 
TATGAACAAG GAATTAACAG TTCAACAATT 
GAGGCTAAAG AACAAAAGAA AGATGTCATT 
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EF046-4 (SEQ ID NO: 172) 

C SNNTGGKNSD ASSTEVSTKQ QTTQSSKKDS SNPDTTPTST 

SSITIETTEN LKNRELNPTD DVSKTRRQLY EQGINSSTIT DKELKEYISE AKEQKKDVIN 
YIKQK 

EF047-1 (SEQ ID NO: 173) 

TAGGGAAAAC AAGGAGGAAT TCTTATGAAA AAGATAGGGC TTATTTCTAG TGCTTTTCTT 
TTAACCCTTG CTTTAGCAGC ATGCGGCGGC GGAAAAAGTA CAGAAAATAC GGATAGTCGT 
TCCAGTGCTG CGGAAAGTAC CACAGTCGAG AGTACAAAAG CATCTGCTAC AAAAGAATCA 
AGTAGCAAAG CAACAACAAA ATCTAGTGAT GCGAAACCGT CAGGAACAAC AACAGCTGAT 
TCGAAAGCAA CAGCTTCTTC TACGAAGGAA GCGGCAAATA ATGGCTCAGC AGAGAAGCAA 
TCACCAGCGA AAAATGCGAA TCCAGATGAC CAAGCCAACC AAGTGCTTAA CCAGCTAGCA 
AACATGTTTC CTGGTCAAGG CTTACCGCAG GCAATTTTAA CGAGTCAAAC GAATAACTTT 
TTAACTGCAG CGACAACTTC ACAAGCGGAT CAAAACAATT TCCGTGTTTT ATATTATGCA 
GAAAAAGAAG CGATTCCAGT GAATGATGCA CGTGTCAATC AGTTAACGCC AATTAGTTCT 
TTTGAGAAAA AAACATATGG CTCTGATGCC GAAGCAAAAA ATGCAGTGAA CCAAATCATT 
GACAATGGCG GTCAACCAGT AGATTTAGGT TACAATATTA CTGGGTATAA ACAAGGGGCG 
GCAGGTTCTA GTTACTTATC TTGGCAAGAA GGCAATTGGA GTTTAGTCGT ACGGGCCTCA 
AATATCAATG GTGAATCGCC TGATGATTTA GCGAAAAATG TTGTCAACAT TTTGGAACAA 
GAAACATTAC CAGCACCGAA TACCGTTGGT CAAATCACAC TGAACGTGGC AGGAACCACT 
GACTATAATC GAAACTCAGT AGTTTGGCAA GCCGGTACAG TCGTTTACTC TGTCCATCAT 
TTTGACCCAA TTCAAGCAGT GAAGATGGCA ACATCAATGT AA 

EF047-2 (SEQ ID NO: 174) 

MKK IGLISSAFLL TLALAACGGG KSTENTDSRS SAAESTTVES TKASATKESS 
SKATTKSSDA KPSGTTTADS KATASSTKEA ANNGSAEKQS PAKNANPDDQ ANQVLNQLAN 
MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVLYYAE KEAIPVNDAR WQLTPISSF 
EKKTYGSDAE AKNAVNQIID NGGQPVDLGY NITGYKQGAA GSSYLSWQEG NWSLWRASN 
INGESPDDLA KNWNILEQE TLPAPNTVGQ ITLNVAGTTD YNRNSWWQA GTWYSVHHF 
DPIQAVKMAT SM 

EF047-3 (SEQ ID NO: 175) 

ATGCGGCGGC GGAAAAAGTA CAGAAAATAC GGATAGTCGT 

TCCAGTGCTG CGGAAAGTAC CACAGTCGAG AGTACAAAAG CATCTGCTAC AAAAGAATCA 
AGTAGCAAAG CAACAACAAA ATCTAGTGAT GCGAAACCGT CAGGAACAAC AACAGCTGAT 
TCGAAAGCAA CAGCTTCTTC TACGAAGGAA GCGGCAAATA ATGGCTCAGC AGAGAAGCAA 
TCACCAGCGA AAAATGCGAA TCCAGATGAC CAAGCCAACC AAGTGCTTAA CCAGCTAGCA 
AACATGTTTC CTGGTCAAGG CTTACCGCAG GCAATTTTAA CGAGTCAAAC GAATAACTTT 
TTAACTGCAG CGACAACTTC ACAAGCGGAT CAAAACAATT TCCGTGTTTT ATATTATGCA 
GAAAAAGAAG CGATTCCAGT GAATGATGCA CGTGTCAATC AGTTAACGCC AATTAGTTCT 
TTTGAGAAAA AAACATATGG CTCTGATGCC GAAGCAAAAA ATGCAGTGAA CCAAATCATT 
GACAATGGCG GTCAACCAGT AGATTTAGGT TACAATATTA CTGGGTATAA ACAAGGGGCG 
GCAGGTTCTA GTTACTTATC TTGGCAAGAA GGCAATTGGA GTTTAGTCGT ACGGGCCTCA 
AATATCAATG GTGAATCGCC TGATGATTTA GCGAAAAATG TTGTCAACAT TTTGGAACAA 
GAAACATTAC CAGCACCGAA TACCGTTGGT CAAATCACAC TGAACGTGGC AGGAACCACT 
GACTATAATC GAAACTCAGT AGTTTGGCAA GCCGGTACAG TCGTTTACTC TGTCCATCAT 
TTTGACCCAA TTCAAGCAGT GAAGATGGCA ACATCAATGT AA 

EP047-4 (SEQ ID NO: 176) 
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CGGG KSTENTDSRS SAAESTTVES TKASATKESS 

SKATTKSSDA KPSGTTTADS KATASSTKEA ANNGSAEKQS PAKNANPDDQ ANQVLNQLAN 
MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVLYYAE KEAIPVNDAR VNQLTPISSF 
EKKTYGSDAE AKNAVNQIID NGGQPVDLGY NITGYKQGAA GSSYLSWQEG NWSLWRASN 
INGESPDDIiA KNWNILEQE TLPAPNTVGQ ITLNVAGTTD YNRNSWWQA GTWYSVHHF 
DPIQAVKMAT SM 



EF048-1 {SEQ ID NO:177) 

TAAGGAGAAA AGTTCATGAA AAAAAGAAAG GTTTTATTTA CAGCAGTTAT GGTATTGGCA 
GGATTACAGT TCCTAAGTGG TTGCGGCAAA ACAGAAGCTT CGGCAAATGA TAGGGTACTC 
TTGCGCTATG CGTATGCTAG TAATAGCCAA CCAGTTATCG ATTCTATGAA GAAATTCGGT 
GAATTAGTAG AGGAAAAAAC AGATGGTAAA GTTCAAATTG AATATTTTCC AGATGGTCAA 
TTAGGAGGAG AAACAGAACT AATTGAATTA ACACAAACAG GTGCAATTGA TTTTGCAAAG 
GTCAGTGGAT CAGCATTAGA AAGTTTTTCT AAAGATTATT CTGTATTTGC CATTCCGTAT 
ATTTTTGATA ATGAAAAACA TTTTTTTAAA GTAATGGATA ATCAAGCGCT AATGCAACCA 
GTGTATGATT CTACAAAAAA ATTAGGATTT GTTGGTTTAA CTTATTATGA CTCTGGTCAA 
CGAAGTTTTT ATATGAGCAA AGGGCCTGTT ACATCTCCAG ATGATTTGAA AGGTAAAAAA 
ATTCGGGTCA TGCAAAGTGA AACCGCCATC AAAATGGTAG AACTTTTAGG GGGTTCGCCA 
GTACCTATGG GTAGTTCGGA AGTATATACT TCTCTACAAT CTAATCTAAT CAACGGTGCA 
GAGAATAATG AGTTCGTTTT ATATACAGCT GGTCATGGTG GTGTGGCTAA GTATTATTCT 
TATGATGAGC ATACTCGAGT GCCAGATATT GTGATTATGA ACGAGGGAAC AAAAGAACGT 
TTGACAGCGA AACAAGAACA AGCGATTGAA GAAGCAGCAA AAGAATCGAC CGCTTTTGAA 
AAAACGGTCT TTAAAGAAGC GGTTGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC 
GTTGTGTTCA ATCAAGTAGA CAGTGAACCA TTCCAAAAAC TTGTTCAACC GTTGCATGAA 
TCATTCAAAA ATAGCTCAGA ACATGGCGAA CTGTATCAGG CTATTCGCCA GTTGGCGGAC 
TAA 

EF048-2 (SEQ ID NO:178) 

MKKRKV LFTAVMVLAG LQLLSGCGKT EASANDTWL RYAYASNSQP VIDSMKKFGE 
LVEEKTDGKV QIEYFPDGQL GGETELIELT QTGAIDFAKV SGSALESFSK DYSVFAIPYI 
FDNEKHFFKV MDNQALMQPV YDSTKKLGFV GLTYYDSGQR SFYMSKGPVT SPDDLKGKKI 
RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE NNEFVLYTAG HGGVAKYYSY 
DEHTRVPDIV IMNEGTKERL TAKQEQAIEE AAKESTAFEK TVFKEAVEEE KKKAQAEYGV 
VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD 

EF048-3 (SEQ ID NO: 179) 

TTGCGGCAAA ACAGAAGCTT CGGCAAATGA TACGGTAGTC 

TTGCGCTATG CGTATGCTAG TAATAGCCAA CCAGTTATCG ATTCTATGAA GAAATTCGGT 
GAATTAGTAG AGGAAAAAAC AGATGGTAAA GTTCAAATTG AATATTTTCC AGATGGTCAA 
TTAGGAGGAG AAACAGAACT AATTGAATTA ACACAAACAG GTGCAATTGA TTTTGCAAAG 
GTCAGTGGAT CAGCATTAGA AAGTTTTTCT AAAGATTATT CTGTATTTGC CATTCCGTAT 
ATTTTTGATA ATGAAAAACA TTTTTTTAAA GTAATGGATA ATCAAGCGCT AATGCAACCA 
GTGTATGATT CTACAAAAAA ATTAGGATTT GTTGGTTTAA CTTATTATGA CTCTGGTCAA 
CGAAGTTTTT ATATGAGCAA AGGGCCTGTT ACATCTCCAG ATGATTTGAA AGGTAAAAAA 
ATTCGGGTCA TGCAAAGTGA AACCGCCATC AAAATGGTAG AACTTTTAGG GGGTTCGCCA 
GTACCTATGG GTAGTTCGGA AGTATATACT TCTCTACAAT CTAATCTAAT CAACGGTGCA 
GAGAATAATG AGTTCGTTTT ATATACAGCT GGTCATGGTG GTGTGGCTAA GTATTATTCT 
TATGATGAGC ATACTCGAGT GCCAGATATT GTGATTATGA ACGAGGGAAC AAAAGAACGT 
TTGACAGCGA AACAAGAACA AGCGATTGAA GAAGCAGCAA AAGAATCGAC CGCTTTTGAA 
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AAAACGGTCT TTAAAGAAGC GGTTGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC 
GTTGTGTTCA ATCAAGTAGA CAGTGAACCA TTCCAAAAAC TTGTTCAACC GTTGCATGAA 
TCATTCAAAA ATAGCTCAGA ACATGGCGAA CTGTATCAGG CTATTCGCCA GTTGGCGGAC 
TAA 

EF048-4 (SEQ ID NO:180) 

CGKT EASANDTWL RYAYASNSQP VIDSMKKFGE 

LVEEKTDGKV QIEYFPDGQL^ GGETELIELT QTGAIDFAKV SGSALESFSK DYSVFAIPYI 
FDNEKHFFKV MDNQALMQPV YDSTKKLGFV GLTYYDSGQR SFYMSKGPVT SPDDLKGKKI 
RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE NNEFVLYTAG HGGVAKYYSY 
DEHTRVPDIV IMNEGTKERL TAKQEQAIEE AAKESTAFEK TVFKEAVEEE KKKAQAEYGV 
VFNQVE3SEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD 



EF049-1 (SEQ ID NO: 181) 

TGAGACTCTT TCTTTTTCAA AATGAGGTAT GGTATAGTTA TAACAGANAT AAAACTANAA 
AAAACAGGAG TGCATAAGAG AATGAAGAAA AAACTAATCT TAGCTGCAGC GGGCGCAATG 
GCCGTTTTTA GTTTAGCAGC GTGTTCAAGC GGTTCAAAAG ATATCGCAAC AATGAAAGGT 
TCAACAATTA CTGTTGATGA TTTTTATAAC CAAATTAAAG AACAAAGCAC TAGCCAACAA 
GCGTTTAGCC AAATGGTTAT TTATAAAGTC TTTGAAGAAA AATATCGCGA CAAAGTAACT 
GACAAAGANA TTCAAAAAAA CTTTGACGAA GCCAAAGAAC AAGTAGAAGC ACAAGGCGGA 
AAGTTCTCTG ATGCATTAAA ACAAGCTGGT TTAACTGAAA AAACATTCAA GAAACAGTTA 
AAACAAAGAG CAGCCTATGA TGCAGGTCTA AAAGCCCACT TAAAAATTAC AGATGAAGAC 
TTAAAAACAG CTTGGGCAAG TTTCCATCCA GAAGTAGAAG CACAAATTAT CCAAGTTGCT 
TCAGAAGATG ATGCCAAAGC TGTCAAGAAA GAAATCACTG AGGGCGGCGA TTTCACAAAA 
ATTGCTAAAG AAAAATCAAC AGATACTGCT ACGAAAAAAG ATGGCGGTAA AATTAAATTT 
GATTCACAAG CAACAACTGT TCCTGCCGAA GTTAAAGAAG CTGCCTTCAA ATTAAAAGAT 
GGCGAAGTGT CAGAACCAAT TGCTGCAACA AATATGCAAA CCTACCAAAC AACCTACTAT 
GTAGTGAAAA TGACGAAAAA CAAAGCAAAA GGCAATGACA TGAAACCTTA TGAAAAAGAG 
ATCAAGAAAA TTGCTGAAGA AACAAAATTA GCCGATCAAA CATTTGTTTC GAAAGTCATT 
AGTGACGAAT TAAAAGCGGC CAATGTGAAA ATTAAAGATG ATGCCTTCAA GAACGCTTTA 
GCAGGCTACA TGCAAACTGA ATCTTCAAGC GCTTCTTCAG AGAAAAAAGA ATCAAAATCA 
AGTGATTCTA AAACAAGCGA TACCAAAACA AGCGACTCTG AAAAAGCAAC AGATTCTTCA 
AGCAAAACAA CAGAATCTTC TTCTAAATAA 

EF049-2 (SEQ ID NO:182) 

MKKK LILAAAGAMA VFSLAACSSG SKDIATMKGS 

TITVDDFYNQ IKEQSTSQQA FSQMVIYKVF EEKYGDKVTD KXIQKNFDEA KEQVEAQGGK 
FSDALKQAGL TEKTFKKQLK QRAAYDAGLK AHLKITDEDL KTAWASFHPE VEAQIIQVAS 
EDDAKAVKKE ITDGGDFTKI AKEKSTDTAT KKDGGKIKFD SQATTVPAEV KEAAFKLKDG 
EVSEPIAATN MQTYQTTVYV VKMTKNKAKG NDMKPYEKEI KKIAEETKLA DQTFVSKVIS 
DELKAANVKI KDDAFKNALA GYMQTESSSA SSEKKESKSS DSKTSDTKTS DSEKATDSSS 
KTTESSSK 

EF049-3 (SEQ ID NO;183) 

GTGTTCAAGC GGTTCAAAAG ATATCGCAAC AATGAAAGGT 

TCAACAATTA CTGTTGATGA TTTTTATAAC CAAATTAAAG AACAAAGCAC TAGCCAACAA 
GCGTTTAGCC AAATGGTTAT TTATAAAGTC TTTGAAGAAA AATATGGCGA CAAAGTAACT 
GACAAAGANA TTCAAAAAAA CTTTGACGAA GCCAAAGAAC AAGTAGAAGC ACAAGGCGGA 
AAGTTCTCTG ATGCATTAAA ACAAGCTGGT TTAACTGAAA AAACATTCAA GAAACAGTTA 
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AAACAAAGAG CAGCCTATGA TGCAGGTCTA AAAGCCCACT TAAAAATTAC AGATGAAGAC 
TTAAAAACAG CTTGGGCAAG TTTCCATCCA GAAGTAGAAG CACAAATTAT CCAAGTTGCT 
TCAGAAGATG ATGCCAAAGC TGTCAAGAAA GAAATCACTG ACGGCGGCGA TTTCACAAAA 
ATTGCTAAAG AAAAATCAAC AGATACTGCT ACGAAAAAAG ATGGCGGTAA AATTAAATTT 
GATTCACAAG CAACAACTGT TCCTGCCGAA GTTAAAGAAG CTGCCTTCAA ATTAAAAGAT 
GGCGAAGTGT CAGAACCAAT TGCTGCAACA AATATGCAAA CCTACCAAAC AACCTACTAT 
GTAGTGAAAA TGACGAAAAA CAAAGCAAAA GGCAATGACA TGAAACCTTA TGAAAAAGAG 
ATCAAGAAAA TTGCTGAAGA AACAAAATTA GCCGATCAAA CATTTGTTTC GAAAGTCATT 
AGTGACGAAT TAAAAGCGGC CAATGTGAAA ATTAAAGATG ATGCCTTCAA GAACGCTTTA 
GCAGGCTACA TGCAAACTGA ATCTTCAAGC GCTTCTTCAG AGAAAAAAGA ATCAAAATCA 
AGTGATTCTA AAACAAGCGA TACCAAAACA AGCGACTCTG AAAAAGCAAC AGATTCTTCA 
AGCAAAACAA CAGAATCTTC TTCTAAATAA 



EF049-4 (SEQ ID NO: 184) 
CSSG SKDIATMKGS 

TITVDDFYNQ IKEQSTSQQA FSQMVIYKVF 
FSDALKQAGL TEKTFKKQLK QRAAYDAGLK 
EDDAKAVKKE ITDGGDFTKI AKEKSTDTAT 
EVSEPIAATN MQTYQTTYYV VKMTKNKAKG 
DELKAANVKI KDDAFKNALA GYMQTESSSA 
KTTESSSK 



EEKYGDKVTD KXIQKNFDEA KEQVEAQGGK 
AHLKITDEDL KTAWASFHPE VEAQIIQVAS 
KKDGGKIKFD SQATTVPAEV KEAAFKLKDG 
NDMKPYEKEI KKIAEETKLA DQTFVSKVIS 
SSEKKESKSS DSKTSDTKTS DSEKATDSSS 



EF050-1 (SEQ ID NO: 185) 

TAGGGTCTGG AAAAGCAGTC AACTGACTTC TTTTCCAAGC CCTTTTTTAG TTCATCGCAG 
AAAGGATGNA AAAAAATGAA CATGCCCAAA AATATCNGTT ATTTTTCTTT GCTAATGGGT 
CTTG1TCTAT TATTAAGTGC TTGCCAAATT GGGGCAACTA CGAAGGATGA CAACCAAGCC 
GCCACAAAAG AAGCAACTGT TGAGTTAAAC CGCACAACAA CACCAACGCT TTTTTTTCAT 
GGTTACGCAG GAACTAAAAA TTCGTTTGGC TCGTTACTGC ATCGCTTGGA GAAACAAGGT 
GCCACAACTC AAGAATTAGT GCTACTCGTT AAACCTGATG GGACCGTGGT TAAAGAGCGA 
GGAGCTTTAA GTGGCAAAGC GACGAATCCC AGTGTTCAAG TTCTATTTGA AGATAATAAA 
AACAATGAAT GGAATCAAAC AGAATGGATA AAAAACACAT TACTCTATTT ACAAAAAAAT 
TATCAAGTGA ACAAAGCCAA TATTGTCGGG CACTCTATGG GTGGTGTTAG TGGTTTACGT 
TATTTAGGAA CCTATGGGCA AGATACATCG TTACCTAAAA TTGAAAAATT CGTCAGCATT 
GGAGCACCTT TCAATGATTT TATTGATACG AGTCAACAGC AAACCATCGA AACGGAACTA 
GAAAACGGCC CCACAGAAAA AAGTAGCCGC TATTTGGATT ATCAAGAGAT GATTAATGTT 
GTTCCAGAAA AACTGCCCAT TTTATTAATT GGTGGTCAAT TAAGTCCAAC AGATTTAAGT 
GATGGAACGG TGCCGTTATC TAGTGCCTTA GCAGTCAACG CCTTGCTAAG ACAGCGAGGA 
ACTCAAGTCA CTAGCCAGAT TATTAAAGGA GAAAATGCAC AACATAGTCA ATTACATGAA 
AATCCTGAAG TAGATCAATT GCTAATCGAA TTTCTATGGC CGAGTAAAAA ATAG 

EF050-2 (SEQ ID NO: 186) 

MNMPKN IXYFSLLMGL VLLLSACQIG ATTKDDNQAA 

TKEATVELNR TTTPTLFFHG YAGTKNSFGS LLHRLEKQGA TTQELVLLVK PDGTWKERG 
ALSGKATNPS VQVLFEDNKN NEWNQTEWIK NTLLYLQKNY QVNKANIVGH SMGGVSGLRY 
LGTYGQDTSL PKIEKFVSIG APFNDFIDTS QQQTIETELE NGPTEKSSRY LDYQEMINW 
PEKLPILLIG GQLSPTDLSD GTVPLSSALA VNALLRQRGT QVTSQIIKGE NAQHSQLHEN 
PEVDQLLIEF LWPSKK 



EF050-3 (SEQ ID NO: 187) 
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TTGCCAAATT GGGGCAACTA CGAAGGATGA 
GCCACAAAAG AAGCAACTGT TGAGTTAAAC 
GGTTACGCAG GAACTAAAAA TTCGTTTGGC 
GCCACAACTC AAGAATTAGT GCTACTCGTT 
GGAGCTTTAA GTGGCAAAGC GACGAATCCC 
AACAATGAAT GGAATCAAAC AGAATGGATA 
TATCAAGTGA ACAAAGCCAA TATTGTCGGG 
TATTTAGGAA CCTATGGGCA AGATACATCG 
GGAGCACCTT TCAATGATTT TATTGATACG 
GAAAACGGCC CCACAGAAAA AAGTAGCCGC 
GTTCCAGAAA AACTGCCCAT TTTATTAATT 
GATGGAACGG TGCCGTTATC TAGTGCCTTA 
ACTCAAGTCA CTAGCCAGAT TATTAAAGGA 
AATCCTGAAG TAGATCAATT GCTAATCGAA 

EF050-4 (SEQ ID NO: 188) 



CAACCAAGCC 

CGCACAACAA CACCAACGCT TTTTTTTCAT 
TCGTTACTGC ATCGCTTGGA GAAACAAGGT 
AAACCTGATG GGACCGTGGT TAAAGAGCGA 
AGTGTTCAAG TTCTATTTGA AGATAATAAA 
AAAAACACAT TACTCTATTT ACAAAAAAAT 
CACTCTATGG GTGGTGTTAG TGGTTTACGT 
TTACCTAAAA TTGAAAAATT CGTCAGCATT 
AGTCAACAGC AAACCATCGA AACGGAACTA 
TATTTGGATT ATCAAGAGAT GATTAATGTT 
GGTGGTCAAT TAAGTCCAAC AGATTTAAGT 
GCAGTCAACG CCTTGCTAAG ACAGCGAGGA 
GAAAATGCAC AACATAGTCA ATTACATGAA 
TTTCTATGGC CGAGTAAAAA ATAG 



CQIG ATTKDDNQAA 

TKEATVELNR TTTPTLFFHG YAGTKNSFGS 
ALSGKATNPS VQVLFEDNKN NEWNQTEWIK 
LGTYGQDTSL PKIEKFVSIG APFNDFIDTS 
PEKLPILLIG GQLSPTDLSD GTVPLSSALA 
PEVDQLLIEF LWPSKK 



LLHRLEKQGA TTQELVLLVK PDGTWKERG 
NTLLYLQKNY QVNKANIVGH SMGGVSGLRY 
QQQTIETELE NGPTEKSSRY LDYQEMINW 
VNALLRQRGT QVTSQIIKGE NAQHSQLHEN 



EF051-1 (SEQ ID NO:189) 

TAAAAGAAAA GAGGCGTTCA AATGTCTAAA CAAAAAAAGG CTGTGTTCCT GCTTAGTTTA 
TTCAGTTTAG TTGCCCTAAT TGCTGCATGT ACAAATCAGC CGCAAAAAGA AACAGTTTCA 
ACAAAAAAAG AAGAAATAAC CCTTGCGGCA GCAGCTAGCT TAGAATCAGT CATGGAGAAG 
AAAATTATTC CAGCCTTTGA AAAAGAGCAT CCAGATATTC AGGTAACTGG AACCTATGAT 
AGTTCTGGAA AATTACAGAT GCAAATTGAA AAAGGCCTAA AAGCCGATGT ATTTTTCTCA 
GCTTCGACAA AACAAATGAA TGCATTGGTT GCAGAAAAAC TAATTAATAA AAAAAGTGTC 
GTTCCTTTAT TGGAAAACCA GCTCGTTCTT ATTGTGCCTA ACCAAGATCA AGCAAAGTGG 
CATGATTTTT CTGATTTATU^ AAAAGCCCAA ATGATAGCAA TTGGTGATCC TGCAAGTGTT 
CCAGCTGGTC AATATGCCGA AGAAGGCTTA AAAGCTTTAG GCGCTTGGTC TTATGTAGAA 
AAACACGCAA GCTTTGGCAC GAATGTAACA GAAGTCCTTG AATGGGTAGC TAATGCAAGT 
GCAGAAGCTG GCTTAGTTTA TGCGACAGAT GCAGCAACCA ATTCAAAAGT AGCGATTGTT 
GCGGCCATGC CTGAAGCTGT TTTGAAAAAG CCAATTATCT ATCCAGTTGG TAAAGTTGCC 
GCCTCTAAGA AACAAAAATC AGCAGATGCT TTTTTAAATT TTTTACAGAG TCAACAATGC 
AGAAAATATT TTGANAATAT TGGCTTTAAG TTAACAAAGT AG 

EF051-2 (SEQ ID NO: 190) 

MSKQ KKAVFLLSLF SLVALIAACT NQPQKETVST KKEEITLAAA ASLESVMEKK 
IIPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA STKQMNALVA EKLINKKSW 
PLLENQLVLI VPNQDQAKWH DFSDLKKAQM lAIGDPASVP AGQYAEEGLK ALGAWSYVEK 
HASFGTNVTE VLEWVANASA EAGLVYATDA ATNSKVAIVA AMPEAVLKKP IIYPVGKVAA 
SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK 

EF051-3 (SEQ ID NO: 191) 

ATGT ACAAATCAGC CGCAAAAAGA AACAGTTTCA 

ACAAAAAAAG AAGAAATAAC CCTTGCGGCA GCAGCTAGCT TAGAATCAGT CATGGAGAAG 
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AAAATTATTC CAGCCTTTGA AAAAGAGCAT CCAGATATTC AGGTAACTGG AACCTATGAT 
AGTTCTGGAA AATTACAGAT GCAAATTGAA AAAGGCCTAA AAGCCGATGT ATTTTTCTCA 
GCTTCGACAA AACAAATGAA TGCATTGGTT GCAGAAAAAC TAATTAATAA AAAAAGTGTC 
GTTCCTTTAT TGGAAAACCA GCTCGTTCTT ATTGTGCCTA ACCAAGATCA AGCAAAGTGG 
CATGATTTTT CTGATTTAAA AAAAGCCCAA ATGATAGCAA TTGGTGATCC TGCAAGTGTT 
CCAGCTGGTC AATATGCCGA AGAAGGCTTA AAAGCTTTAG GCGCTTGGTC TTATGTAGAA 
AAACACGCAA GCTTTGGCAC GAATGTAACA GAAGTCCTTG AATGGGTAGC TAATGCAAGT 
GCAGAAGCTG GCTTAGTTTA TGCGACAGAT GCAGCAACCA ATTCAAAAGT AGCGATTGTT 
GCGGCCATGC CTGAAGCTGT TTTGAAAAAG CCAATTATCT ATCCAGTTGG TAAAGTTGCC 
GCCTCTAAGA AACAAAAATC AGCAGATGCT TTTTTAAATT TTTTACAGAG TCAACAATGC 
AGAAAATATT TTGANAATAT TGGCTTTAAG TTAACAAAGT AG 

EF051-4 (SEQ ID NO: 192) 

CT NQPQKETVST KKEEITLAAA ASLESVMEKK 

IIPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA STKQMNALVA EKLINKKSW 
PLLENQLVLI VPNQDQAKWH DFSDLKKAQM lAIGDPASVP AGQYAEEGLK ALGAWSYVEK 
HASFGTNVTE VLEWVANASA EAGLVYATDA ATNSKVAIVA AMPEAVLKKP IIYPVGKVAA 
SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK 



EF052-1 (SEQ ID NO:193) 

TAAAGTAGGA GAAGCGCAAG CGAAAAAAGT 
CCCACAATGG GTACCATGGG TAGCATTATC 
TTACTTAGTT CGTCGTGGAG AGAAGTGGAA 
NGAAATCTTC NGTTTTTATT ATTGTTGGTT 
GCAGAAAATA GGGAGACCAC AGAAGTCGGA 
TCAAAAAAAA ATCCAGTTGT GAATGTATTG 
GTTCGTAGCA GAACGCAAAT AAAAAGATTA 
CTAAGCTGGT TTGGCATATT GTTTTTAATA 
TTATGTAGAA AAGGAGAATA A 



GAATCAATCG OCAGCGTATC AAGTAGTGAT 
TTTGACAGTA GCACTTGCTG GATTGATTGC 
AAACGAAGGG GAAGTGACAT AATGAGANGA 
CTATTAATTT ATATTCCTCA AACAACTTAT 
ATCGGGTTTA CAAAAACTTC AGACATACCA 
CCGCAAACAA CCATTCAATC GCTATCAATC 
CCTAAAACTG GTGACAATCG AATAACTTGG 
AGTAGTTTTT GGCTGTTTCT ATTTAGACAA 



EF052-2 (SEQ ID NO: 194) 
MRXX 

NLXFLLLLVL LIYIPQTTYA ENRETTEVGI GFTKTSDIPS KKNPWNVLP QTTIQSLSIV 
RSRTQIKRLP KTGDNRITWL SWFGILFLIS SFWLFLFRQL CRKGE 



EF052-3 (SEQ ID NO: 195) 

AGAAAATA GGGAGACCAC AGAAGTCGGA ATCGGGTTTA CAAAAACTTC AGACATACCA 
TCAAAAAAAA ATCCAGTTGT GAATGTATTG CCGCAAACAA CCATTCAATC GCTATCAATC 
GTTCGTAGCA GAACGCAAAT AAAAAGAT 

EF052-4 (SEQ ID NO: 196) 

ENRETTEVGI GFTKTSDIPS KKNPWNVLP QTTIQSLSIV 
RSRTQIKR 



EF053-1 (SEQ ID NO: 197) 



TAGTCATGGC ACCATAACAA GGAGGAGAGA AGTGAGATGA AAAAATACCT TTTGCTTAGT 
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TGTTTTTTAG GTCTTTTCAG CTTCTGTCAT TCAGACACTG CGTTTGGAGA AGCAGCTTAT 
GAAAATAGTG GTGTTGTCTC CTTTTATGGA ACGTATGAAT ATCCCACAGA AGAGTCGACA 
ACAGCGACTA GTAATTCTTC CACAACGACC GAACCCACCA AGCCAGCTGA GGGAGGCGCT 
TCATCCGTCC TTTCTTCTGG CGTATATGGA TCGCGACAAG GAAGATTACC AGCGACAGGT 
ACCACCAATC AAGCACCATT TATTTATTTG GGAATCAGCC TTATCACTAT AGGCATATTA 
TTTATTAAAA GGAGAAGAGA AGATGAAAAA AACAGTATTA GCAGTAGTAG GGATTGTAGG 
ATTTAG 

EF053-2 (SEQ ID NO: 198) 

MKKYLLLSC FLGLFSFCHS DTAFGEAAYE NSGWSFYGT YEYPTEESTT 

ATSNSSTTTE PTKPADGGAS SVLSSGVYGS RQGRLPATGT TNQAPFIYLG ISLITIGILF 

IKRRREDEKN SISSSRDCRI 

EF053-3 (SEQ ID NO: 199) 

TTTGGAGA AGCAGCTTAT 

GAAAATAGTG GTGTTGTCTC CTTTTATGGA ACGTATGAAT ATCCCACAGA AGAGTCGACA 
ACAGCGACTA GTAATTCTTC CACAACGACC GAACCCACCA AGCCAGCTGA CGGAGGCGCT 
TCATCCGTCC TTTCTTCTGG CGTATATGGA TCGCGACAAG GAAGA 

EF053-4 (SEQ ID NO:200) 

FGEAAYE NSGWSFYGT YEYPTEESTT 
ATSNSSTTTE PTKPADGGAS SVLSSGVYGS RQGR 



EF054-1 (SEQ ID NO:201) 

TAAATAAAAA ATTATTTGGA GGAAATTACA ATGAAAAAAA TTATTTTATC AAGCTTGTTT 
AGTGCAGTAC TAGTATTCGG TGGCGGAAGT ATAACAGCAT TCGCTGACGA TTTAGGACCA 
ACAGATCCAG CAACTCCACC AATTACCGAA CCAACTGATT CTAGTGAACC TACCAATCCT 
ACTGAGCCGG TGGATCCTGC AGAACCGCCA GTAATACCAA CTGATCCAAC AGAACCAAGC 
AAGCCAACCG AGCCTACAAC ACCGAGTGAG CCAGAAAAGC CAACAGAACC AACAACGCCA 
ATTGATCCTG GAACGCCGGT TGAACCGACT GAACCAAGCG AGCCAACAGA ACCTAGTCAA 
CCAACCGAGC CTACAACACC AAGCGAACCA GAAAAACCTG TTACTCCAGA ACAACCGAAA 
GAACCAACTC AACCAGTGAT TCCAGAAAAA CCAGCAGAAC CAGAAACACC AAAAACTCCT 
GAACAGCCCA CTAAACCAAT AGACGTAGTC GTTACACCTA GTGGAGAAAT TGATAAAACG 
AATCAATCGG CAGGAACACA ACCAAGTATT CCTATTGAAA CAAGCAACTT AGCGGAGGTA 
ACACATGTAC CAAGTGAAAC TACTCCAATT ACAACAGAAG CTGGGGAAGA AATTGTAGCA 
GTAGATAAAG GTGTTCCGTT AACCAAAACA CCAGAAGGAT TAAAACCAAT TAGCAGCTCG 
TATAAGGTTT TACCTAGCGG AAACGTTGAG GTAAAAGCAA GTGATGGAAA AATGAAAGTA 
TTGCCACATA CAGGAGAGAA ATTCACACTC CTTTTCTCTG TATTGGGAAG CTTCTTTGTA 
TTAATTTCAG GATTCTTTTT CTTTAAAAAG AATAAGAAAA AAGCTTAA 

EF054-2 (SEQ ID NO:202) 

M KKIILSSLFS AVLVFGGGSI TAFADDLGPT DPATPPITEP TDSSEPTNPT 
EPVDPAEPPV IPTDPTEPSK PTEPTTPSEP EKPTEPTTPI DPGTPVEPTE PSEPTEPSQP 
TEPTTPSEPE KPVTPEQPKE PTQPVIPEKP AEPETPKTPE QPTKPIDVW TPSGEIDKTN 
QSAGTQPSIP lETSNLAEVT HVPSETTPIT TEAGEEIVAV DKGVPLTKTP EGLKPISSSY 
KVLPSGNVEV KASDGKMKVL PHTGEKFTLL FSVIjGSFFVL ISGFFFFKKN KKKA 



EFO.54-3 (SEQ ID NO: 203) 
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A 

ACAGATCCAG CAACTCCACC AATTACCGAA CCAACTGATT CTAGTGAACC TACGAATCCT 
ACTGAGCCGG TGGATCCTGC AGAACCGCCA GTAATACCAA CTGATCCAAC AGAACCAAGC 
AAGCCAACCG AGCCTACAAC ACCGAGTGAG CCAGAAAAGC CAACAGAACC AACAACGCCA 
ATTGATCCTG GAACGCCGGT TGAACCGACT GAACCAAGCG AGCCAACAGA ACCTAGTCAA 
CCAACCGAGC CTACAACACC AAGCGAACCA GAAAAACCTG TTACTCCAGA ACAACCGAAA 
GAACCAACTC AACCAGTGAT TCCAGAAAAA CCAGCAGAAC CAGAAACACC AAAAACTCCT 
GAACAGCCCA CTAAACCAAT AGACGTAGTC GTTACACCTA -GTGGAGAAAT TGATAAAACG 
AATCAATCGG CAGGAACACA ACCAAGTATT CCTATTGAAA CAAGCAACTT AGCGGAGGTA 
ACACATGTAC CAAGTGAAAC TACTCCAATT ACAACAGAAG CTGGGGAAGA AATTGTAGCA 
GTAGATAAAG GTGTTCCGTT AACCAAAACA CCAGAAGGAT TAAAACCAAT TAGCAGCTCG 
TATAAGGTTT TACCTAGCGG AAACGTTGAG GTAAAAGCAA GTGATGGAAA AATGAAAGTA 
T 

EF054-4 (SEQ ID NO: 204) 
DDLGPT DPATPPITEP TDSSEPTNPT 

EPVDPAEPPV IPTDPTEPSK PTEPTTPSEP EKPTEPTTPI DPGTPVEPTE PSEPTEPSQP 
TEPTTPSEPE KPVTPEQPKE PTQPVIPEKP AEPETPKTPE QPTKPIDVW TPSGEIDKTN 
QSAGTQPSIP lETSNLAEVT HVPSETTPIT TEAGEEIVAV DKGVPLTKTP EGLKPISSSY 
KVLPSGNVEV KASDGKMKV 



EF055-1 (SEQ ID NO:205). 

TAACAAAAGG TTGTTTTGTC TTTCTTGTGT AAAAGGGCAA GAAAGGCTAG CGAGTTAAAA 
GGAGGTTTTT CAATGAAAAA AAAGCGTTAT TTAATGATTG TGTGTCTACT ATCTTCTCCT 
AGTTTTTTTA TAAATGTTGA AGCGTCTGAT GGTGGTTCTA GTTCGGTGGG GATTGAATTT 
TACCAAAATC CGAGAACACC CGCTCCTAAA GATCCCCCAC CGAAAACAGA TGCGCCAGCT 
GCTGATCCCA AGGAACCAGC TGGTCCTCCG CAAGGAGATC AACGAAGTGG TGGTTCGACA 
CAGACCACCA CAACTGGCTC AACGCTCCCT CGTACAGGGA GCAAGAGTCA GGCAAATTTG 
AGCATTCTCN GNTTCGCCTT AATCGGTTTG GCGGGAATCG TACATAGAAA GAAGGGACGA 
CATGAAGCAA ACTAA 

EF055-2 (SEQ ID NO:206) 

MKKKRYL MIVCLLSSPS FFINVEASDG GSSSVGIEFY 

QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ GDQRSGGSTQ TTTTGSTLPR TGSKSQANLS 
ILXFALIGLA GIVHRKKGRH EAN 

EF055-3 (SEQ ID NO:207) 

AGCGTCTGAT GGTGGTTCTA GTTCGGTGGG GATTGAATTT 

TACCAAAATC CGAGAACACC CGCTCCTAAA GATCCCCCAC CGAAAACAGA TGCGCCAGCT 
GCTGATCCCA AGGAACCAGC TGGTCCTCCG CAAGGAGATC AACGAAGTGG TGGTTCGACA 
CAGACCACCA CAACTGGCTC AACG 



EF055-4 (SEQ ID NO: 208) 
SDG GSSSVGIEFY 

QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ GDQRSGGSTQ TTTTGST 



\ 
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EF056-1 (SEQ ID NO: 209) 

TAAATGAAAA AAAAGCGTTA TTTAATAATT GCGTGTTTAC TATTTTCCCC TAGTTTTTTT 
ATAAATGTTG AAGCATCTGA GGGTGGTTCT AGTTCGGTGG GAATTGAATT TTACCAAAAT 
CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG ATGAGCCAGC TGCGGATCCC 
AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAAGTG GTGGTTCGAC ACAGACCACC 
ACAGCTGGCT CGCAGCTCCC TCGTACAGGA AGCAAGAGTC AGGCAAACCT GAGCATTCTT 
GGTCTTGTCT TGATTGGTCT TGTCGGAATG GTCCAGAGAA AGAAGGGACG ACATGAAGCA 
AACTAA 



EF056-2 (SEQ ID NO:210) 

MKKKRYLIIA CLLFSPSFFI NVEASEGGSS 
EPAGPLQGDQ RSGGSTQTTT AGSQLPRTGS 

EF056-3 (SEQ ID N0:211) 



SVGIEFYQNP ATPAPKDAPP KTDEPAADPK 
KSQANLSILG LVLIGLVGMV QRKKGRHEAN 



ATCTGA GGGTGGTTCT AGTTCGGTGG GAATTGAATT TTACCAAAAT 

CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG ATGAGCCAGC TGCGGATCCC 
AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAAGTG GTGGTTCGAC ACAGACCACC 
ACAGCTGGCT CGCAG 

EF056-4 (SEQ ID NO:212) 

SEGGSS SVGIEFYQNP ATPAPKDAPP KTDEPAADPK 
EPAGPLQGDQ RSGGSTQTTT AGSQ 

EF057-1 (SEQ ID NO:213) 

TAATGTTTAT TGGCTGGGCC AGTCAATGTT GAAAATGGGG AAGGAGGAAT TCAGATGAAA 
ATCATAAAAA GGTTTAGTTT GGTATGTTTA GGGCTATTGA TCATTGGGTT GCNAACAAAA 
AGCGNTATGG CTGAAGAAAA TAATTATGAA TCAAATGGTC AAGCGAGCTT CTATGGTACC 
TACGTTTATG AGAATGAAAA AGAGTCAAAT GACGTAGCGT ATACCCAACA ATCAGAAGAA 
CAGGGAAGAA ACAATTTAGC TGCTTCTGGA CAAGCAGTTT TACCTAAAAC AGGCGAGTCT 
GAAAATCCGC TGTATTCCTT GATAGGAGTT AGTTTGTTGG GGATAGTCAT TTATTTAATT 
AATAAAATGA AACGAGAGAA GGAGTTTATT TAA 

EF057-2 (SEQ ID N0:214) 

MKI IKRFSLVCLG LLIIGLXTKS XMAEENNYES NGQASFYGTY 

VYENEKESND VAYTQQSEEQ GRNNLAASGQ AVLPKTGESE NPLYSLIGVS LLGIVIYLIN 
KMKREKEFI 

EF057-3 (SEQ ID NO:215) 

AAA TAATTATGAA TCAAATGGTC AAGCGAGCTT CTATGGTACC 

TACGTTTATG AGAATGAAAA AGAGTCAAAT GACGTAGCGT ATACCCAACA ATCAGAAGAA 
CAGGGAAGAA ACAATTTAGC TGCTTCTGGA CAAGCAGTTT 

EF057-4 (SEQ ID NO:216) 

EENNYES NGQASFYGTY 

VYENEKESND VAYTQQSEEQ GRNNLAASGQ AV 
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EF058-1 (SEQ ID N0:217) 

TGAAGAACGT TCTATTTGGT TGACGATTGC AGGCCTGCTA ATCATTGGGA TGGTAGTCAT 
TTGGCTATTT TATCAAAAAC AAAAAAGAGG AGAGAGAAAA TGAAGCAATT AAAAAAAGTT 
TGGTACACCG TTAGTACCTT GTTACTAATT TTGCCACTTT TCACAAGTGT ATTAGGGACA 
ACAACTGCAT TTGCAGAAGA AAATGGGGAG AGCGCACAGC TCGTGATTCA CAAAAAGAAA 
ATGACGGATT TACCAGATCC GCTTATTCAA AATAGCGGGA AAGAAATGAG CGAGTTTGAT 
AAATATCAAG GACTGGCAGA TGTGACGTTT AGTATTTATA ACGTGACGAA CGAATTTTAC 
GAGCAACGAG CGGCAGGCGC AAGCGTTGAT GCAGCTAAAC AAGCTGTCCA AAGTTTAACT 
CCTGGGAAAC CTGTTGCTCA AGGAACCACC GATGCAAATG GGAATGTCAC TGTTCAGTTA 
CCTAAAAAAC AAAATGGTAA AGATGCAGTG TATACCATTA AAGAAGAACC AAAAGAGGGT 
GTAGTTGCTG CTACGAATAT GGTGGTGGCG TTCCCAGTTT ACGAAATGAT CAAGCAAACA 
GATGGTTCCT ATAAATATGG AACAGAAGAA TTAGCGGTTG TTCATATTTA TCCTAAAAAT 
GTGGTAGCCA ATGATGGTAG TTTACATGTG AAAAAAGTAG GAACTGCTGA AAATGAAGGA 
TTAAATGGCG CAGAATTTGT TATTTCTAAA AGCGAAGGCT CACCAGGCAC AGTAAAATAT 
ATCCAAGGAG TCAAAGATGG ATTATATACA TGGACAACGG ATAAAGAACA AGCAAAACGC 
TTTATTACTG GGAAAAGTTA TGAAATTGGC GAAAATGATT TCACAGAAGC AGAGAATGGA 
ACGGGAGAAT TAACAGTTAA AAATCTTGAG GTTGGTTCGT ATATTTTAGA AGAAGTAAAA 
GCTCCAAATA ATGCAGAATT AATTGAAAAT CAAACAAAAA CACCATTTAC AATTGAAGCA 
AACAATCAAA CACCTGTTGA AAAAACAGTC AAAAATGATA CCTCTAAAGT TGATAAAACA 
ACACCAAGCT TAGATGGTAA AGATGTGGCA ATTGGCGAAA AAATTAAATA TCAAATTTCT 
GTAAATATTC CATTGGGGAT TGCAGACAAA GAAGGCGACG CTAATAAATA CGTCAAATTC 
AATTTAGTTG ATAAACATGA TGCAGCCTTA ACTTTTGATA ACGTGACTTC TGGAGAGTAT 
GCTTATGCGT TATATGATGG GGATACAGTG ATTGCTCCTG AAAATTATCA AGTGACTGAA 
CAAGCAAATG GCTTCACTGT CGCCGTTAAT CCAGCGTATA TTCCTACGCT AACACCAGGC 
GGCACACTAA AATTCGTTTA CTTTATGCAT TTAAATGAAA AAGCAGATCC TACGAAAGGC 
TTTAAAAATG AGGCGAATGT TGATAACGGT CATACCGACG ACCAAACACC ACCAACTGTT 
GAAGTTGTGA CAGGTGGGAA ACGTTTCATT AAAGTCGATG GCGATGTGAC AGCGACACAA 
GCCTTGGCGG GAGCTTCCTT TGTCGTCCGT GATCAAAACA GCGACACAGC AAATTATTTG 
AAAATCGATG AAACAACGAA AGCAGCAACT TGGGTGAAAA CAAAAGCTGA AGCAACTACT 
TTTACAACAA CGGCTGATGG ATTAGTTGAT ATCACAGGGC TTAAATACGG TACCTATTAT 
TTAGAAGAAA CTGTAGCTCC TGATGATTAT GTCTTGTTAA CAAATCGGAT TGAATTTGTG 
GTCAATGAAC AATCATATGG CACAACAGAA AACCTAGTTT CACCAGAAAA AGTACCAAAC 
AAACACAAAG GTACCTTACC TTCAACAGGT GGCAAAGGAA TCTACGTTTA CTTAGGAAGT 
GGCGCAGTCT TGCTACTTAT TGCAGGAGTC TACTTTGCTA GACGTAGAAA AGAAAATGCT 
TAA 



EF058-2 (SEQ ID NO:218) 

MKQLKKVW YTVSTLLLIL PLFTSVLGTT 
TAFAEENGES AQLVIHKKKM TDLPDPLIQN 
QRAAGASVDA AKQAVQSLTP GKPVAQGTTD 
VAATNMWAF PVYEMIKQTD GSYKYGTEEL 
NGAEFVISKS EGSPGTVKYI QGVKDGLYTW 
GELTVKNLEV GSYILEEVKA PNNAELIENQ 
PSLDGKDVAI GEKIKYQISV NIPLGIADKE 
YALYDGDTVI APENYQVTEQ ANGFTVAVNP 
KNEANVDNGH TDDQTPPTVE WTGGKRFIK 
IDETTKAATW VKTKAEATTF TTTADGLVDI 
NEQSYGTTEN LVSPEKVPNK HKGTLPSTGG 



SGKEMSEFDK YQGLADVTFS lYNVT^TEFYE 
ANGNVTVQLP KKQNGKDAVY TIKEEPKEGV 
AWHIYPKNV VANDGSLHVK KVGTAENEGL 
TTDKEQAKRF ITGKSYEIGE NDFTEAENGT 
TKTPFtlEAN NQTPVEKTVK NDTSKVDKTT 
GDANKYVKFN LVDKHDAALT FDNVTSGEYA 
AYIPTLTPGG TLKFVYFMHL NEKADPTKGF 
VDGDVTATQA LAGASFWRD QNSDTANYLK 
TGLKYGTYYL EETVAPDDYV LLTNRIEFW 
KGIYVYLGSG AVLLLIAGVY FARRRKENA 



EF058-3 (SEQ ID NO: 219) 
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AGAAGA AAATGGGGAG AGCGCACAGC TCC: 
ATGACGGATT TACCAGATCC GCTTATTCAA 
AAATATCAAG GACTGGCAGA TGTGACGTTT 
GAGCAACGAG CGGCAGGCGC AAGCGTTGAT 
CCTGGGAAAC CTGTTGCTCA AGGAACCACC 
CCTAAAAAAC AAAATGGTAA AGATGCAGTG 
GTAGTTGCTG CTACGAATAT GGTGGTGGCG 
GATGGTTCCT ATAAATATGG AACAGAAGAA 
GTGGTAGCCA ATGATGGTAG TTTACATGTG 
TTAAATGGCG CAGAATTTGT TATTTCTAAA 
ATCCAAGGAG TCAAAGATGG ATTATATACA 
TTTATTACTG GGAAAAGTTA TGAAATTGGC 
ACGGGAGAAT TAACAGTTAA AAATCTTGAG 
GCTCCAAATA ATGCAGAATT AATTGAAAAT 
AACAATCAAA CACCTGTTGA AAAAACAGTC 
ACACCAAGCT TAGATGGTAA AGATGTGGCA 
GTAAATATTC CATTGGGGAT TGCAGACAAA 
AATTTAGTTG ATAAACATGA TGCAGCCTTA 
GCTTATGCGT TATATGATGG GGATACAGTG 
CAAGCAAATG GCTTCACTGT CGCCGTTAAT 
GGCACACTAA AATTCGTTTA CTTTATGCAT 
TTTAAAAATG AGGCGAATGT TGATAACGGT 
GAAGTTGTGA CAGGTGGGAA ACGTTTCATT 
GCCTTGGCGG GAGCTTCCTT TGTCGTCCGT 
AAAATCGATG AAACAACGAA AGCAGCAACT 
TTTACAACAA CGGCTGATGG ATTAGTTGAT 
TTAGAAGAAA CTGTAGCTCC TGATGATTAT 
GTCAATGAAC AATCATATGG CACAACAGAA 
AAACACAAAG GTACCTTACC T 

EF058-4 (SEQ ID NO:220) 



X3ATTCA CAAAAAGAAA 

AATAGCGGGA AAGAAATGAG CGAGTTTGAT 
AGTATTTATA ACGTGACGAA CGAATTTTAC 
GCAGCTAAAC AAGCTGTCCA AAGTTTAACT 
GATGCAAATG GGAATGTCAC TGTTCAGTTA 
TATACCATTA AAGAAGAACC AAAAGAGGGT 
TTCCGAGTTT ACGAAATGAT CAAGCAAACA 
TTAGCGGTTG TTCATATTTA TCCTAAAAAT 
AAAAAAGTAG GAACTGCTGA AAATGAAGGA 
AGCGAAGGCT CACCAGGCAC AGTAAAATAT 
TGGACAACGG ATAAAGAACA AGCAAAACGC 
GAAAATGATT TCACAGAAGC AGAGAATGGA 
GTTGGTTCGT ATATTTTAGA AGAAGTAAAA 
CAAACAAAAA CACCATTTAC AATTGAAGCA 
AAAAATGATA CCTCTAAAGT TGATAAAACA 
ATTGGCGAAA AAATTAAATA TCAAATTTCT 
GAAGGCGACG CTAATAAATA CGTCAAATTC 
ACTTTTGATA ACGTGACTTC TGGAGAGTAT 
ATTGCTCCTG AAAATTATCA AGTGACTGAA 
CCAGCGTATA TTCCTACGCT AACACCAGGC 
TTAAATGAAA AAGCAGATCC TACGAAAGGC 
CATACCGACG ACCAAACACC ACCAACTGTT 
AAAGTCGATG GCGATGTGAC AGCGACACAA 
GATCAAAACA GCGACACAGC AAATTATTTG 
TGGGTGAAAA CAAAAGCTGA AGCAACTACT 
ATCACAGGGC TTAAATACGG TACCTATTAT 
GTCTTGTTAA CAAATCGGAT TGAATTTGTG 
AACCTAGTTT CACCAGAAAA AGTACCAAAC 



EENGES AQLVIHKKKM TDLPDPLIQN SGKEMSEFDK YQGLADVTFS lYNVTNEFYE 
QRAAGASVDA AKQAVQSLTP GKPVAQGTTD ANGNVTVQLP KKQNGKDAVY TIKEEPKEGV 
VAATNMWAF PVYEMIKQTD GSYKYGTEEL AWHIYPKNV VANDGSLHVK KVGTAENEGL 
NGAEFVISKS EGSPGTVKYI QGVKDGLYTW TTDKEQAKRF ITGKSYEIGE NDFTEAE^rGT 
GELTVKNLEV GSYILEEVKA PNNAELIENQ TKTPFTIEAN NQTPVEKTVK NDTSKVDKTT 
PSLDGKDVAI GEKIKYQISV NIPLGIADKE GDANKYVKFN LVDKHDAALT FDNVTSGEYA 
YALYDGDTVI APENYQVTEQ ANGFTVAVNP AYIPTLTPGG TLKFVYFMHL NEKADPTKGF. 
KNEANVDNGH TDDQTPPTVE WTGGKRFIK VDGDVTATQA LAGASFWRD QNSDTANYLK 
IDETTKAATW VKTKAEATTF TTTADGLVDI TGLKYGTYYL EETVAPDDYV LLTNRIEFW 
NEQSYGTTEN LVSPEKVPNK HKGT 



EF059-1 (SEQ ID NO: 221) 



TAGATTGGAA 
TTAGCAGGGG 
ACAACAGGGA 
GAGCCAGAGC 
ACCGAACCTA 
CCAACAGAGC 
GTACCAGAGC 



GAATGAAAAT 
GAAGCAGTGT 
GTGTTTTACC 
AACCAACAGA 
GTGAGCCTTC 
CAACAACGCC 
AACCAACAGA 



GAAAAAAATG 
TTCTGCTTAT 
AGATGAACCG 
GCCAAGTACA 
AAAACCGACG 
AAGTAAGCCA 
GCCAAGTGTA 



ATTATTATTG 
GCGCAAGAAT 
AATGTACCAA 
CCAGAGCAAC 
GATCCTTCGT 
GAGCAACCAA 
CCAGAAAAAC 



CCTTATTCAG 
CAGAAGGAAA 
CTGACCCAAT 
CATCGGAACC 
TACCAGACGA 
rAGAGCCAAC 
CAGTAGAACC 



TACAAGCCTT 
TCTTGGTGAA 
AACGCCAAGT 
GTCAACACCA 
ACCGAGCGTA 
AACGCCAAGT 
AAATAAACCA 
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ACCGAGCCAG AAAAGCCTGT GCCAGTTGTT CCTGAAAAAC CAGTTGTACC ACAACAACCA 
GAGCAACCAA CAGATGTGGT GGTAAAGCCA AATGGAGAAA TTGCAACAGG AGAATCTACA 
CAACAGCCAA CTGTTCCAAT TGAAACGAAT AACCTTTCAG AAGTAACACA TGTCCCAACT 
GTGACGACAC CGATTGAAAC AGCAAGCGGA GAAGCAATTG TCGCAGTGGA TAAGGGCGTT 
CCTTTAACAC AAACGGCTGA TGGATTAAAA CCGATTAAAA GTGAATATAA AGTATTACCA 
AGTGGCAATG TACAAGTGAA AAGTGCTGAC GGAAAAATGA AAGTACTTCC TTACACTGGT 
GAAAAAATGG GCATAATTGG GTCAATCGCT GGTGTATGTT TGACTGTTTT ATCAGGAATC 
TTAATTTATA AA7VAACGTAA AGTGTAG 

EF059-2 (SEQ IDNO:222) 

MKKMI IIALFSTSLL AGGSSVSAYA QESEGNLGET TGSVLPDEPN VPTDPITPSE 
PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD PSLPDEPSVP TEPTTPSKPE QPTEPTTPSV 
PEQPTEPSVP EKPVEPNKPT EPEKPVPWP EKPWPQQPE QPTDVWKPN GEIATGESTQ 
QPTVPIETNN LSEVTHVPTV TTPIETASGE AIVAVDKGVP LTQTADGLKP IKSEYKVLPS 
GNVQVKSADG KMKVLPYTGE KMGIIGSIAG VCLTVLSGIL lYKKRKV 

EF059-3 (SEQ ID NO:223) 

AGAAGGAAA TCTTGGTGAA 

ACAACAGGGA GTGTTTTACC AGATGAACCG AATGTACCAA CTGACCCAAT 7UVCGCCAAGT 
GAGCCAGAGC AACCAACAGA GCCAAGTACA CCAGAGCAAC CATCGGAACC GTCAACACCA 
ACCGAACCTA GTGAGCCTTC AAAACCGACG GATCCTTCGT TACCAGACGA ACCGAGCGTA 
CCAACAGAGC CAACAACGCC AAGTAAGCCA GAGCAACCAA CAGAGCCAAC AACGCCAAGT 
GTACCAGAGC AACCAACAGA GCCAAGTGTA CCAGAAAAAC CAGTAGAACC AAATAAACCA 
ACCGAGCCAG AAAAGCCTGT GCCAGTTGTT CCTGAAAAAC CAGTTGTACC ACAACAACCA 
GAGCAACCAA CAGATGTGGT GGTAAAGCCA AATGGAGAAA TTGCAACAGG AGAATCTACA 
CAACAGCCAA CTGTTCCAAT TGAAACGAAT AACCTTTCAG AAGTAACACA TGTCCCAACT 
GTGACGACAC CGATTGAAAC AGCAAGCGGA GAAGCAATTG TCGCAGTGGA TAAGGGCGTT 
CCTTTAACAC AAACGGCTGA TGGATTAAAA CCGATTAAAA GTGAATATAA AGTATTACCA 
AGTGGCAATG TACAAGTGAA AAGTGCTGAC GGAAAAATGA AAGTAC 

EF059-4 (SEQ ID NO:224) 

EGNLGET TGSVLPDEPN VPTDPITPSE 
PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD PSLPDEPSVP 
PEQPTEPSVP EKPVEPNKPT EPEKPVPWP EKPWPQQPE 
QPTVPIETNN LSEVTHVPTV TTPIETASGE AIVAVDKGVP 
GNVQVKSADG KMKV 

EF060-1 (SEQ ID NO:225) 

TGAAAAATAG ACAAGGAGCA CGCGATGATG ACAATGAAAA GTAAAGGGTC ACTTCTGGTG 
ACGTTGGGAA TACTTTTAAC CGTTGGCATT GCGAGTCTAA TTGTTTCTTC TGAGAGTTTT 
GCAGAAGAAG TAGGGCAAAC GAATATCGGT GTAACGTTCT ATGGAGGAAA AGAGCCACTA 
AAAACGGAAG GTGTCATTAA GCCAATAGAG CAACCAGTCA CTGATAAAGA TAAAAAAACG 
TCACAACAAC AAGACAAAGT GAGCAGAAAA ACCACTGCTA AAACGAATCC GACTAATGCA 
CAGACGTCAT TACCAAGGAC AGGTGAACGA AATAGCACGT GGCTTTACAG CCTTGGTATT 
GCCTGTTTAC TCGTAGTACT AACAAGTTTC TATTATTTGA ATAAAAAAAG GAAAAAGGAA 
AAATAA 

EF060-2 (SEQ ID NO:226) 

MMT MKSKGSLLVT LGILLTVGIA SLIVSSESFA EEVGQTNIGV TFYGGKEPLK 



TEPTTPSKPE QPTEPTTPSV 
QPTDVWKPN GEIATGESTQ 
LTQTADGLKP IKSEYKVLPS 
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TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ TSLPRTGERN STWLYSLGIA 
CLLWLTSFY YLNKKRKKEK 

EF060-3 (SEQ ID NO:227) 

AGAAGAAG TAGGGCAAAC GAATATCGGT GTAACGTTCT ATGGAGGAAA AGAGCCACTA 
AAAACGGAAG GTGTCATTAA GCCAATAGAG CAACCAGTCA CTGATAAAGA TAAAAAAACG 
TCACAACAAC AAGACAAAGT GAGCAGAAAA ACCACTGCTA AAACGAATCC GACTAATGCA 
CAGACGTCAT 

EF060-4 (SEQ ID NO:228) 
EEVGQTNIGV TFYGGKEPLK 

TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ TS 
EF061-1 (SEQ ID NO:229) 

TAATGGAACG ACCGACAGAA GAAGATTTTG AACTTACAAA TTAAAATTAA AATGGAGGAA 
ATAATGATGA AAAAAATTCT TTTTGCTAGT TTATTTAGTG CCACACTACT ATTTGGGGGA 
AGTGAAATTT CTGCTTTTGC ACAAGAAATT ATCCCTGATG ATACTACGAC ACCGCCCATT 
GAAGTACCAA CAGAACCAAG TACACCAGAA AAGCCAACAG ATCCAACACC GCCAATTGAG 
CCACCTGTAG ACCCTGTAGA GCCACCTATT ACACCAACGG AGCCAACAGA ACCGACAGAG 
CCGACAACAC CAACAGAACC TACAACTCCT ACAGAGCCAA GTGAACCAGA ACAACCAACG 
GAGCCAAGTA AACCAGTAGA ACCTGAAAAA CCAGTTACAC CAAGCAAACC AGCAGAACCC 
GAAAAAACTG TGACACCAAC TAAACCAACA GAATCTGAAA AACCAGTAGA ACCAGCAGAA 
CCAAGCAAGC CAATCGACGT TGTTGTAACG CCAACAGGGG AATTAAATCA CGCTGGAAAT 
GGTACACAAC AGCCAACAGT CCCTATTGAA ACAAGTAATT TGGCAGAAAT CACGCACGTG 
CCTAGTGTAA CAACACCTAT TACAACTACA GACGGAGAAA ACATTGTAGC TGTAGAAAAA 
GGTGTTCCAC TTACACAAAC AGCAGAAGGG TTAAAACCTA TTCAATCNAG TTACAAAGTA 
TTGCCTAGCG GAAATGTAGA AGTAAAAGGT AAGGACGGTA AAATGAAGGT TTTACCATAC 
ACAGGTGAAG AAATGAATAT CTTTTTATCT GCCGTAGCGG TATCTTGTCT GTAG 



EF061-2 (SEQ ID NO:230). 

MMKKILFASL FSATLLFGGS EISAFAQEII 
VPTEPSTPEK PTDPTPPIEP PVDPVEPPXT 
PSKPVEPEKP VTPSKPAEPE KTVTPTKPTE 
TQQPTVPIET SNLAEITHVP SVTTPITTTD 
PSGNVEVKGK DGKMKVLPYT GEEMNIFLSA 

EF061-3 (SEQ ID NO:231) 



PDDTTTPPIE 

PTEPTEPTEP TTPTEPTTPT EPSEPEQPTE 
SEKPVQPAEP SKPIDVWTP TGELNHAGNG 
GENIVAVEKG VPLTQTAEGL KPIQSSYKVL 
VAVSCL 



GAAATTT CTGCTTTTGC ACAAGAAATT ATCCCTGATG ATACTACGAC ACCGCCCATT 
GAAGTACCAA CAGAACCAAG TACACCAGAA AAGCCAACAG ATCCAACACC GCCAATTGAG 
CCACCTGTAG ACCCTGTAGA GCCACCTATT ACACCAACGG AGCCAACAGA ACCGACAGAG 
CCGACAACAC CAACAGAACC TACAACTCCT ACAGAGCCAA GTGAACCAGA ACAACCAACG 
GAGCCAAGTA AACCAGTAGA ACCTGAAAAA CCAGTTACAC CAAGCAAACC AGCAGAACCC 
GAAAAAACTG TGACACCAAC TAAACCAACA GAATCTGAAA AACCAGTAGA ACCAGCAGAA 
CCAAGCAAGC CAATCGACGT TGTTGTAACG CCAACAGGGG AATTAAATCA CGCTGGAAAT 
GGTACACAAC AGCCAACAGT CCCTATTGAA ACAAGTAATT TGGCAGAAAT CACGCACGTG 
CCTAGTGTAA CAACACCTAT TACAACTACA GACGGAGAAA ACATTGTAGC TGTAGAAAAA 
GGTGTTCCAC TTACACAAAC AGCAGAAGGG TTAAAACCTA TTCAATCNAG TTACAAAGTA 
TTGCCTAGCG GAAATGTAGA AGTAAAAGGT AAGGACGGTA AAATGAAGGT TT 
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EF061-4 (SEQ ID NO:232) 
QEII PDDTTTPPIE 

VPTEPSTPEK PTDPTPPIEP PVDPVEPPIT PTEPTEPTEP TTPTEPTTPT EPSEPEQPTE 
PSKPVEPEKP VTPSKPAEPE KTVTPTKPTE SEKPVQPAEP SKPIDWVTP TGELNHAGNG 
TQQPTVPIET SNLAEITHVP SVTTPITTTD GENIVAVEKG VPLTQTAEGL KPIQSSYKVL 
PSGNVEVKGK DGKMKV 

EF062-1 (SEQ ID NO:233) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
GATAATGTAC AAGCCGCGGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT 
AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATCAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT 
GATGCAGAAA AAGTGACGAT TCATTTATCC AAAGTGAAAG TTTATCAAGC AGAGGCAAGT 
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAA^FTA TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
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TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 



EF062-2 (SEQ ID NO:234) 

MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
IIWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK^ lAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASWPE LPQTGEKQNV LLTVAGSLAA MLGLAGLGFK RRKETK 



EF062-3 (SEQ ID NO:235) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA 



AAACAATATA AGACATATAA AGCTAAGAAT 
AGTGTGTTAG GAGCCX3TAGG ATTAGCTACT 
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GATAATGTAC AAGCCGCGGA ATTAGATACG 
AACCCCGACC TGCAGTCAGA AAAGGAAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT 
GGAGCTGAAA AATCAGCACA AGAACAACCA 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT 
AACATTACCG TTGTTGAAAA ACCAGCAGAA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN 
AAGAACGAAA ACAGCTATGT CAATGAAGCG 
GTCGTGACGA AAGACACTAA AATTTCGTCG 
GATTTTAATA AAGTAAATGC AGGGGATTCA 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA 
GGACTAAACG CTAGTTATTT AGGACGTAAA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG 
GATTTTGGGG CNAACAATGC GTTCAAATAC 
GATGGAAAAT TTTACTCACC GGAAGATATT 
AATAGTGATT GGGACGCTGT AGGTCACAAG 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN 
ATTTTCAATT ATGGGAATCC AAAAGAACCA 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN 
NTCAATGATT TAAATGTGAA NCGTGGCGAT 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA 
GATGCAGAAA AAGTGACGAT TGATTTATCC 
CTNAACGANA AAGACTNAAA AGCTGTTGCT 
GTGACTGCTT CTTATGANCT CAATTTAGAT 
AACGCNGACG GNTCNGTTGT TTTAGCAATG 
GTAGTGAAAA - ATGTAGAAGG CGATTTTGAA 
GAAACGGTAA CAAATACAGT GATTAACCAT 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT 
CAAACAAAAA TTTATTATGA AGTGAAATCT 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG 
TGGCACGCTA TTACNAANTA TGACCTTAAA 
GATATTTCTG CCTACATTCT TTTAGAAAAC 
AATCAAGCAT TATTGGCNGC NTTAAATGAA 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT 
AAACCAACCA AAGCCGTTCA TAACAAGAAA 
CGTGGTGATG TTCTTTCTTA TGAAATGACN 
GCCTTTGATA CAGTCGATCT TGCGACAGGC 
AANGTGACAC CAATCAAAGA CTTACTTCGT 
AACCAGTTCA CGATCTCNTG GGACGATGCC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG 
GTCAAAGCCG ATGTTTCTG6 NGAOXSTTTAT 
CGAATTAAAA CCAATACNGT TGTCAACCAT 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA 
TTCTTCTATG AATTTACAAG TAGTGACATT 
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CAACCAGAAA CAACGACGGT TCAACCCAAT 
CCTAAAACGG CAGTATCTGA AGAAGCAACA 
AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GATACCACAA ACGCGCAACA ACCAACAGTA 
GTAGTAAGCC CTGAAACAAC CAATGAACCT 
GAAAATGAAG TGAATAAATC AACGTCCATT 
AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GANAAAGAAG TCGCNGAATA CAACAAGCAT 
ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
AAAGATATCT TTACAAAATT ACGGAAAGAT 
AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
AAAAATAAAC CAGTGACAGT GACCTATACA 
ATTACAAAAG CAGAATTTGT TTATGAACTA 
AATGCAGTAT TTTCAAACGA TCCGATTATC 
GGTAAGGATG TTAAAACACG CTTAACGATT 
CTACCAGATA AAGATAGTCC ATTTGCGTAT 
AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
ATGACAACAA AAGGAAAAAG TAATGTGCCT 
ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
GAAAAAGCAA CGATTGAATT CAATNGATAC 
AATAAAGAAG TCACOXSATGG NCAGAAAAAT 
TCTTTACAAT ACATTGTGAC AGGGGATACG 
GTAACNAAAC AAGGGATTCG AGATACNTTT 
AAAGTGAAAG TTTATCAAGC AGACCCAAGT 
GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
CAAAACACCG TCACAGCAAT GATGAAAACC 
GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
AATACAGCTG TTCAGCTGAC AAANGATGGN 
GTGCCAGGTA GTAATCCTTC CAAAGAOXSTA 
GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
TCCGAACGTC CAGCNAACTA TGGCGGAATN 
GACACGACCC ATGATCGTTT CACAGGNAAA 
GTAGGGGANA AAACGTTAAA AGCAGGAACA 
AAAGACAATA AAGACTTGAC GTTTACNATG 
GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
ACAGGTGACG TAGAAAACAC GCAAACAGAA 
ACNGTGGTGA CGCATACNCC TGATGATCCA 
GGGGAAGANA TTAANCATGG AAAAGTNGCT 
TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GTTTCTTTCT TCGATGATTA CGATGAAACG 
GTCAAAGATT CTAAAGGGGN AGACATTACG 
AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CAAGAATTGC GTGTAACNCT CCCTACAAAA 
AATTCAGCGG AACAAAATAC ATTTGGNCAA 
ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
AATGGNGCCA CAATCAAATT AGGGGAGAAN 
CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
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TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TG 



EF062-4 (SEQ ID NO:236) 
AELDTQ PETTTVQPNN 

PDLQSEKETP KTAVSEEATV QKDTTSQPTK 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE 
ITWEKPAED LGNVSSKDtiA AKEKEVDQLQ 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD 
ISAYILLENK DNKDLTFTMN QALLAALNEG 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG 
GVERIAAGDV YNTIEESFNN EKIKTNTWT 
WEKASV 

EF063-1 (SEQ ID NO: 237) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA 
GATAATGTAC AAGCCGCGGA ATTAGATACG 
AACCCCGACC TGCAGTCAGA AAAGGAAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT 
GGAGCTGAAA AATCAGCACA AGAACAACCA 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT 
AACATTACCG TTGTTGAAAA ACCAGCAGAA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN 
AAGAACGAAA ACAGCTATGT CAATGAAGCG 
GTCGTGACGA AAGACACTAA AATTTCGTCG 
GATTTTAATA AAGTAAATGC AGGGGATTCA 



VEEVAPENKG TEQSSATPND TTNAQQPTVG 
NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
KEQAKKIAQQ AAELKAKNEK lAKENAEIAA 
SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
SFVKEANIiGS NGGYAVLLEK NKPVTVTYTG 
AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
WKITASQAF XDAMNLKENK NVAHSWKAFI 
HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 



AAACAATATA AGACATATAA AGCTAAGAAT 
AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
CAACCAGAAA CAACGACGGT TCAACCCAAT 
CCTAAAACGG CAGTATCTGA AGAAGCAACA 
AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GATACCACAA ACGCGCAACA ACCAACAGTA 
GTAGTAAGCC CTGAAACAAC CAATGAACCT 
GAAAATGAAG TGAATAAATC AACGTCCATT 
AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GANAAAGAAG TCGCNGAATA CAACAAGCAT 
ATTAGTAAAA ACCTAGTGTT CGATGAATCT 
ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
AAAGATATCT TTACAAAATT ACGGAAAGAT 
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ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT. 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
^P^CAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT 
GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 



EF063-2 (SEQ ID NO:238) 
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MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK lAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASWPE LPQTGEKQNV LLTVAGSLAA MLGLAGLGFK RRKETK 



EF063-3 (SEQ ID NO:239) 

GGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT 

AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTCAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
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AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATAC^P^TT 
GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTG 



EF063-4 {SEQ ID NO:240) 
ELDTQ PETTTVQPNN 

PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK lAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK V 

EF064-1 (SEQ ID NO:241) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
GATAATGTAC AAGCCGCGGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT 
AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGGTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT 
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GATGCAGAAA 
CTNAACGANA 
GTGACTGCTT 
AACGCNGACG 
GTAGTGAAAA 
GAAACGGTAA 
AAAGCAGATA 
CAAACAAAAA 
ACNGAAGAAT 
TGGCACGCTA 
GATATTTCTG 
AATCAAGCAT 
TCTGTGTATC 
AACTACAACA 
AAACCAACCA 
CGTGGTGATG 
GCCTTTGATA 
AANGTGACAC 
AACCAGTTCA 
CCACAAGCCT 
GTCAAAGCCG 
CGAATTAAAA 
GTTATTAAAG 
TTCTTCTATG 
TGGTCGATTA 
TTTGCCAATT 
TCGAAACTAT 
TTTTTNGATG 
ATTGGTGTAG 
AATGAGAAGA 
CCAGAAAAAA 
TTAGTGGTAG 
GTCTTATTAA 
AAACGTAGAA 



AAGTGACGAT 
AAGACTNAAA 
CTTATGANCT 
GNTCNGTTGT 
ATGTAGAAGG 
CAAATACAGT 
AAAACGGTAC 
TTTATTATGA 
GGGGCATGAA 
TTACNAANTA 
CCTACATTCT 
TATTGGCNGC 
TGGAAGTCGA 
AAGAGCTTGT 
AAGCCJTTCA 
TTCTTTCTTA 
CAGTCGATCT 
CAATCAAAGA 
CGATCTCNTG 
TTATTCTAGC 
ATGTTTCTGG 
CCAATACNGT 
TNGGTGACAA 
AATTTACAAG 
GCGATAAACT 
CTAATTTTGT 
TCACGATGAC 
CGATGAATCT 
AACGAATTGC 
TTAAAACNAA 
CAGTGATTGT 
AAAAGGCAAG 
CGGTAGCTGG 
AAGAAACAAA 



TGATTTATCC 
AGCTGTTGCT 
CAATTTAGAT 
TTTAGCAATG 
CGATTTTGAA 
GATTAACCAT 
AGTTGGCAGT 
AGTGAAATCT 
TGATGTCTTG 
TGACCTTAAA 
TTTAGAAAAC 
NTTAAATGAA 
ACGGATNAAA 
NCGTTCTAAT 
TAACAAGAAA 
TGAAATGACN 
TGCGACAGGC 
CTTACTTCGT 
GGACGATGCC 
GNATGGTGGG 
NGATGTTTAT 
TGTCAACCAT 
ACAAAGTCAA 
TAGTGACATT 
AGACGTCAAA 
TTTAGCAGAC 
CTTTGAACAA 
AAAAGAAAAC 
GGCAGGAGAC 
TACGGTAGTG 
ACCACCAACA 
TGTNGTGCCA 
TAGTTTAGCC 
ATAA 



AAAGTGAAAG 
GCAGCNATTA 
CAAAACACCG 
GGGTATAAAT 
AATACAGCTG 
GTGCCAGGTA 
GTTTCTCTAC 
TCCGAACGTC 
GACACGACCC 
GTAGGGGANA 
AAAGACAATA 
GGAAGCAATA 
ACAGGTGACG 
ACNGTGGTGA 
GGGGAAGANA 
TGGGACTTAA 
GTTTCTTTCT 
GTCAAAGATT 
AAAGGCACGG 
CAAGAATTGC 
AATTCAGCGG 
ATTCCAAAAG 
AATGGNGCCA 
CCTGCAGAAT 
CATGACAAAT 
GGAACCAAAG 
GGGGTAGTGA 
AAAAACGTTG 
GTTTACAACA 
ACNCATACGC 
CCAAAAACAC 
GAATTGCCGC 
GCAATGCTTG 



TTTATCAAGC 
ATTCAGGAAN 
TCACAGCAAT 
ATTTACTTGT 
TTCAGCTGAC 
GTAATCCTTC 
ATGATAAAGA 
CAGCNAACTA 
ATGATCGTTT 
AAACGTTAAA 
AAGACTTGAC 
AAGTAGGCAA 
TAGAAAACAC 
CGCATACNCC 
TTAANCATGG 
AAGGGTACGA 
TCGATGATTA 
CTAAAGGGGN 
TGACNATNTC 
GTGTAACNCT 
AACAAAATAC 
TGAANCCTAA 
CAATCAAATT 
ACGCTGGNGT 
TTAGTGGCCA 
TGAATAAAGG 
AAATCACGGC 
CACACTCATG 
CAATCGAAGA 
CAGAAAAACC 
CGCAAGCACC 
AAACAGGCGA 
GCTTAGCAGG 



AGACGCAAGT 
AGCTAAAGAC 
GATGAAAACC 
CTTGCCGTTT 
AAANGATGGN 
CAAAGATGTA 
TATTCCGTTA 
TGGCGGAATN 
CACAGGNAAA 
AGCAGGAACA 
GTTTACNATG 
ACAAGCTTGG 
GCAAACAGAA 
TGATGATCCA 
AAAAGTNGCT 
TAAAGACTTT 
CGATGAAACG 
AGACATTACG 
TGCCAAAGAC 
CCCTACAAAA 
ATTTGGNCAA 
AAAAGACGTG 
AGGGGAGAAN 
TGTGGAAGAA 
ATGGTCTGTG 
GGACGACATT 
CAGTCAAGCC 
GAAAGCGTTC 
ATCTTTCAAC 
ACAAACNCCA 
AGTAGAGCCA 
AAAACAAAAT 
CTTAGGCTTT 



EF064-2 (SEQ ID N0:242) 

MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED IiGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK lAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK AN\ArPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVY<3ADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ ^rTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
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FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTOTWNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASWPE LPQTGEKQNV LLTVAGSLAA MLGLAGLGFK RRKETK 

EF064-3 (SEQ ID NO:243) 

AGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
GTAGTGAAAA ATOTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCGGAACGTC CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCGGC AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 



EF064-4 (SEQ ID NO: 244) 

VTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD 
ISAYILLENK DNKDLTFTMN QALLAALNEG 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN 



NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
SAEQNTFGQR IKTNTWNHI PKVXPKKDW 



wo 98/50554 



PCT/US98/089S9 



146 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 



IKVGDKQSQN GATIKLGEXF 
ANSNFVLADG TKVNKGDDIS 
GVERIAAGDV YNTIEESFNN 
WEKASV 



FYEFTSSDIP AEYAGWEEW 
KLFTMTFEQG WKITASQAF 
EKIKTNTWT HTPEKPQTPP 



SISDKLDVKH DKFSGQWSVF 
XDAMNLKENK NVAHSWKAFI 
EKTVIVPPTP KTPQAPVEPL 



EF065-1 (SEQ ID NO:245) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA 
AGTCTGGCTG ATTGTAAACG GATATTGGAA 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT 
TTTCCACATG TAAGACAAGC GATTGATGAA 
GTGATGCTGG CTTCATATCG CGGCGGAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT 
GGATTGAAAC TCGCTTTAGA TACGTACAAT 
ACGTATTTCG TATTAGTGAC AGATGGGGTC 
AAGACCAATA CCAATGATTC AATCAATGAA 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT 
GAAATTACTA ACCAAGGCTA TGAAATGATT 
AGTTCAGTGA ATTCATACTT TGATAAATAT 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA 
TTTACAACCC AATTAAAACA AATTGTCAAA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT 
GGAAATGATG TGCCTGTTCA AATTAACGGA 
TACGTAGGAA ACATCACGAT TCACTACGAA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA 
ACGATTCCTA AAAATGaCAA TGCGCATGCG 
ACAAAAGATA TCGAAAATCA AGAACACTTA 
TGGCATGTCA AAACAGCCTT TGGCAACGAA 
GATGACATTA ATAAAGTGCT AGATATCATT 
GATGTTACAG CTAACGGCAC AGTAACACAA 
AAACAAGCAG ACAGCTATGA CTATTTAAGT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT 
ACCGTAACAC CACCGCCAGT TGATCCAAAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT 
GAAACAAGCA CTTGGACCCA AGCCAGCATG 
ACTGATGTAA AAGTCACAGA TGAAAATGGT 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG 
AGTGGTCATA CGTACACAAT GACCATTACT 
GAATTAGCAC CTTATATTGA ACAAGGTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA 
GAAGATCCAA CGATTACAAA AGATATCGAA 
GACCAAGAAT TTAAATGGAA CGTCAAAACA 
CAAGCCAGCA TGGTGGATGA CATTAATAAA 
GANGAAAATG GCAAAGATGT TACAGATAAT 
ACTTTTACTA TGAACAAAAA AGATGACAGC 
ATGACTATTA CCACTAAAAT TAAAACTGAC 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC 
CATTCCAACA AGCCAACCGT AACACCGCCT 



TTTAAGAAAG CAACGAAATT ATTATCGACA 
AATTTCAGTC CCACATTGGC TTTAGCTGAA 
ATGACCAATA CGGTGAAAGT GAAAGACGAC 
GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
CAATTTATGT TTCCTGATGG AAAGACAAAA 
CGCGTCAATA CGCAATTGAC TTATGATAAA 
CGGACGTATG GTGGTACGCC AACCGCCCCA 
CAAACACACG GAGATTTAAC GAATCGAAAA 
GCTAATACAC GTTTAGATGG TTACTTGCAT 
TATCCAGATC CAAGACATCC TCTTCAAGTC 
GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GATTTTATTA CAAGCCAATC TATTGATGAT 
GATCGTCTGG CGCAATCGAC ACCAGCAACA 
ATTCAATCTG CGACCGCTAC GGACGATGCT 
CAAACCATTT CAGCAACTAG TACAGAAGGT 
GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
GGAACAATTG CTAAGGAATT TCCAGAAGCG 
TGTGACGTGA CGCCAGAAGA TCCAACGATT 
GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGTGAAAG TCACCGACGA AAATGGTAAA 
GAAAATAACA AAGTAACTTT TGAAATGAAC 
GGTCATACGT ATACAATGAC TATCACCACT 
TTAGCGCCTT ACATTGAACA AGGCGGGATT 
GAAGGTGACG TGTTACATTC CAACAAACCA 
ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GTAGATGACA TTAATAAAGT GTTAGACATC 
AAAGATGTTA CAGCTAAC<3G CAAAGTAACA 
AACAANCAAG CNGACAGCTA TGACTATTTA 
ACTAAAATCA AAGCTAGCGC AACGGACGAA 
ATTCCCAACC AAGCCGACTT GAACTTTGGC 
CCAACCGTAA CACCACCTGC ACCAACGGCA 
GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GCTTTCGGTA ACGAAACAAG CACATGGACC 
GTGTTAGACA TCACACACGT GAAAGTTNCT 
GGCATAGTAA CACAAGAAAA TAACAAAGTA 
TACTCTTACT TAGCTGGTCA TACATACACA 
GCAACGGATG AAGAATTAGC GCCTTATATT 
TTAAACTTTG GCAACGAAGG TGACGTGTTG 
GCACCAACGC CAGAAGACCC AAAAAAACCT 
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GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF065-2 (SEQ ID NO:246) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF065-3 (SEQ ID NO:247) 

GGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
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CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TT 

EF065-4 (SEQ ID NO:248) 

AVKAGDTEGM TNTVKVKDDS 

LADCKRILEG QATFPVQAGE TEPVDLWVE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLKTFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH 

EF066-1 (SEQ ID NO:249) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
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GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGAGGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCA-^^CCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT -GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF066-2 (SEQ ID NO:250) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENFTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FIMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
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PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 

EF066-3 (SEQ ID NO:251) 

GGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGAGGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCA 

EF066-4 (SEQ ID NO:252) 

AVKAGDTEGM TNTVKVKDDS 

LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVT 

EF067-1 (SEQ ID NO:253) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATOXSGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
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AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF067-2 (SEQ ID NO:254) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLWVE DASGSFSDNF PHVRQAIDEV VQGLSE3QDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGPGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK ITmiDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYIMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
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ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 

EF067-3 (SEQ ID NO:255) 

GCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 

GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC. TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CAITAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TT 

EF067-4 (SEQ ID NO:256) 

VLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDI^FKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH 



EF068-1 (SEQ ID NO:257) 

TAGGGGAAGC TAATGATCTT GGTATTTATC 
ATGAAAAAGA AAATTGTTGA GGATTTTAAT 
CGCAAGATGC TTAATTTAGC AATATCAAGT 
GTAAGTATAG CTGTTACCTC TGGCACAATC 
CTATTATCAA ATGTTACGTC AAATAATGAC 
GCCGCAAACC AAAATCAACC AGTTAATTTC 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC 



GTTTATTTTA AAGAAAAGAG GGACGATCAG 
CGGAAAAGTC AGCATAAAAA ATGGACAAAA 
GGTTTATTAT TTACGTCATT AGCAATCCCT 
AGTGCATCAG CAGCGGTCTT GGATATGGAA 
AGTGGCACTT CAACGAGTAA TC-GTTGGACA 
ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATACCAATG TCACGATTGA TCTTTCAAAA 
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GTTACTTTTT TGACTGCCGT TTTGAATGCA GCCAATGATT TAACCAATGT <3ATTACTC7VA 
ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATCCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCAGG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGC<3GATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGGGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATGGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGGGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCAC<3GTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
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ACTGTCGTAG CGAAAAATGC AAGAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGC GAGCATGGGA ATCATCAAAC GCAAACGTAA AAACTAG 

EF068-2 (SEQ ID NO:258) 

M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV 

SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGAIiGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT lEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
TATANEALTA lAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGKE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTII AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMGI IKRKRKN 
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TABLE 1 . Nucleotide and Amino Acid Seqeuences of E. faecalis Genes. 
EF068-3 (SEQ ID NO:259) 

CTC TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA 

CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT CAACGAGTAA TC<3TTGGACA 
GCCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC AATACCAATG TCACGATTGA TCTTTCAAAA 
GTTACTTTTT TGACTGCCGT TTTGAATGCA GCCAATGATT TAACCAATGT GATTACTCAA 
ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTl^ ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCGGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC 



EF068-4 (SEQ ID NO:260) 

TSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTP 

EF069-1 (SEQ ID NO:261) 

TAGGGGAAGC TAATGATCTT GGTATTTATC GTTTATTTTA AAGAAAAGAG GGACGATCAG 
ATGAAAAAGA AAATTGTTGA GGATTTTAAT CGGAAAAGTC AGCATAAAAA ATGGACAAAA 
CGCAAGATGC TTAATTTAGC AATATCAAGT GGTTTATTAT TTACGTCATT AGCAATCCCT 
GTAAGTATAG CTGTTACCTC TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA 
CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT CAACGAGTAA TCGTTGGACA 
GCCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG TTCCTCCTGA <3TTAAGAGGA 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC AATACCAATG TCACGATTGA TCTTTCAAAA 
GTTAC T T TTT TGACTGCCGT TTTGAATGCA GCCAATGATT TAACCAATGT GATTACTCAA 
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ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCC<3ATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
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GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCAC<3GTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGC GAGCATGGGA ATCATCAAAC GCAAACGTAA AAACTAG 

EF069-2 (SEQ ID NO:262) 

M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV 

SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLIjATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT lEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATC TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA ^AFTITLPTG 
TATANEALTA lAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGKE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTII AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
' TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMGI IKRKRKN 

EF069-3 (SEQ ID NO:263) 
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AGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 

GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTOXSC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAA 



EF069-4 (SEQ ID NO:264) 



AGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT lEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGT 



EF070-1 (SEQ ID NO:265) 

TAGGGGAAGC TAATGATCTT GGTATTTATC 
ATGAAAAAGA AAATTGTTGA GGATTTTAAT 
CGCAAGATGC TTAATTTAGC AATATCAAGT 
GTAAGTATAG CTGTTACCTC TGGCACAATC 
CTATTATCAA ATGTTACGTC AAATAATGAC 
GCCGCAAACC AAAATCAACC AGTTAATTTC 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC 
GTTACTTTTT TGACTGCCGT TTTGAATGCA 
ATTACCAGTG GGGCGTTAGG GAATTTAACT 



GTTTATTTTA AAGAAAAGAG GGACGATCAG 
CGGAAAAGTC AGCATAAAAA ATGGACAAAA 
GGTTTATTAT TTACGTCATT AGCAATCCCT 
AGTGCATCAG CAGCGGTCTT tSGATATCGAA 
AGTGGCACTT CAACGAGTAA TCGTTGGACA 
ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
GTGTTAGTGG TTCCTGCTGA GTTAAGAGGA 
AATACCAATG TCACGATTGA TCTTTCAAAA 
GCCAATGATT TAACCAATGT GATTACTCAA 
GGTGTTGATA TTGATTTGAC <3GAAGTGAAT 
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CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGAGGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC A<3CTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
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ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTCACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG -CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGC GAGCATGGGA ATCATCAAAC GCAAACGTAA AAACTAG 

EF070-2 (SEQ ID NO:266) 

M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV 

SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNIIiQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT lEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
TATANEALTA lAKDAAGJCES NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGKE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTII AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMGI IKRKRKN 

EF070-3 (SEQ ID NO: 267) 

CGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
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GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCOTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACT 



EF70-4 (SEQ ID NO:268) 
DGDGNE SQPTEVTVPE 

DATVAAPTVT TVTGTTATGY QVTGTAEPNV 
TATANEALTA lAKDAAGKES NPTAFKTPAD 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI 
TPGETITIIS KDGAGNESQP ATAVIPADW 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP 
TAGQATAQQS LLATATDGAG HYSTATTFMT 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS 
YL 



TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
PDAPVATPTV DKITGSTTNG YQWGAAEVG 
GKASANETIT WAKNATGKE SQPATATTPV 
DVRDADGTII AATTANETGQ YTVTLPAGW 
LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
EKETLTALTT DTQGNVSPKT TFMTPADITG 
NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
LGNGIiGTNGS GYHPKLSTIS YGTGNHGKTG 



EF071-1 (SEQ ID NO:269) 

TAAGTAGAAG TGGTCGGGAC AAACGTAGAA 
GTCCCGCCAT TTATCTGCAG GTTTAAGCCG 
ATGGCTTTTT TAAGAAAGGA GCATGCTATG 
GTGATTGGTT TAAGTTTAAC GATTCCGATG 
CCAATCAACT TTACTTATTT TCCCGGCTCT 
TCTGGAAACG AGCGGAACCT AGGACCACAC 
CGAAATTGGT CAAATGCTTA TGTCTCATAT 



CTTTCGCTGA TTGCCGAAGA AATTACTTCT 
TGGAAGGGAA GTTATTTTGA CTTTCCTTTC 
TTTAAAAAAT TAATGATTCA ACTTGCTTTA 
ACGGCTTNCG CTTACACCAT CGAAGCGGAT 
GCAAGCAATG AATTAATTGT TTTACATGAA 
AGTTTAGACA ATGAAGTGGC CTATATGAAA 
TTTGTCGGAT CTGGTGGACG AGTGAAACAA 
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TTAGCTCCTG CTGGCCAAAT TCAATATGGC GCAGGTTCTT TAGCTAATCA AAAAGCCTAT 
GCGCAAATCG AATTGGCTCG AACGAATAAT GCGGCGACAT TTAAAAAAGA TTATGCTGCC 
TATGTTAATT TGGCCCGTGA TTTGGCTCAG AACATTGGTG CTGATTTTTC TCTGGACGAT 
GGAACAGGTT ATGGCATAGT CACTCATGAT TGGATTACAA AAAATTGGTG GGGAGATCAT 
ACAGATCCTT ATGGTTATTT AGCGCGTGGG GGATTAGTAA AGCGCATTGG CACNAGATTT 
ACAACGGGCG TTTCNGNAAC AGGTGAGACT GGTCATTATT CAGCCAGGTA A 



EF071-2 (SEQ ID NO:270) 
MF KKLMIQLALV 

IGLSLTIPMT AXAYTIEADP INFTYFPGSA 
NWSNAYVSYF VGSGGRVKQL APAGQIQYGA 
VNLARDLAQN IGADFSLDDG TGYGIVTHDW 
TGVSXTGETG HYSAR 

EF071-3 (SEQ ID NO:271) 



SNELIVLHES GNERNLGPHS LDNEVAYMKR 
GSLANQKAYA QIELARTNNA ATFKKDYAAY 
ITKNWWGDHT DPYGYLARGG LVKRIGTRFT 



G TTTAAAAAAT TAATGATTCA ACTTGCTTTA 

GTGATTGGTT TAAGTTTAAC GATTCCGATG ACGGCTTNCG CTTACACCAT CGAAGCGGAT 
CCAATCT^CT TTACTTATTT TCCCGGCTCT GCAAGCAATG AATTAATTGT TTTACATGAA 
TCTGGAAACG AGCGGAACCT AGGACCACAC AGTTTAGACA ATGAAGTGGC CTATATGAAA 
CGAAATTGGT CAAATGCTTA TGTCTCATAT TTTGTCGGAT CTGGTGGACG AGTGAAACAA 
TTAGCTCCTG CTGGCCAAAT TCAATATGGC GCAGGTTCTT TAGCTAATCA AAAAGCCTAT 
GCGCAAATCG AATTGGCTCG AACGAATAAT GCGGCGACAT TTAAAAAAGA TTATGCTGCC 
TATGTTAATT TGGCCCGTGA TTTGGCTCAG AACATTGGTG CTGATTTTTC TCTGGACGAT 
GGAACAGGTT ATGGCATAGT CACTCATGAT TGGATTACAA AAAATTGGTG GGGAGATCAT 
ACAGATCCTT ATGGTTATTT AGCGCGTGGG GGATTAGTAA AGCGCATTGG CACNAGATTT 
ACAACGGGCG TTTCNGNAAC AGGTGAGACT GGTCATTATT CAGCCAGGT 



EF071-4 (SEQ ID NO:272) 



F KKLMIQLALV 

IGLSLTIPMT AXAYTIEADP INFTYFPGSA 
NWSNAYVSYF VGSGGRVKQL APAGQIQYGA 
VNLARDLAQN IGADFSLDDG TGYGIVTHDW 
TGVSXTGETG HYSAR 

EF072-1 (SEQ ID NO: 273) 



SNELIVLHES GNERNLGPHS LDNEVAYMKR 
GSLANQKAYA QIELARTNNA ATFKKDYAAY 
ITKNWWGDHT DPYGYLARGG LVKRIGTRFT 



TAATCAATGA AAAACGCACG 
TTTTCACAGC AAGCATTAGC 
TTATTGTTCC CTGATGGTCA 
CTGCTTCAAA ATTATCGGGG 
CCGTTTTATC AGCTTCGTTC 
GAAACCGGTG CAACAAATAG 
GAAGATGGAG TGGTTTCTTT 
TATTTATTTG TTGAAGCGGA 
GTGATTTTGC CTGTTCAAGA 
AAAAATGAAG AAAATGCCTA 
CAAGGCTTTA ATCAAGGAGA 
ATTTTAGGAT ATCAGGAATT 
CCAGAATCAA TTGAGGTAAA 
ACGCAAAAGC ATGGATTTAC 



TTGGTTAAGT ATTTGCGTCA 
AGAGGCATCG CAAGCAAGCG 
ATTACCAGAA CAGCAGCAAA 
CTTAAATGAC GTCACTTATC 
TGAAGGAAAA ACGGTCCAAG 
AAAACCGATC GCAGAAGATA 
TTCATTAGCT AGCAAAGATT 
AGCACCAGAA GTGGTAAAGG 
TCCACAAGGG CAATCGTTAA 
TGACTTACCA CCACTTGAAA 
GCACATTAAC TATCAGTTAA 
CCGTTTGTCA GATAAGGCGG 
AGTGGCTGGA AAAACAGTTA 
GCTTGATTTT TCAATTAAAG 



TGCTACTCGC TCTTTTCGGG 
TTCAAGTTAC GTTGCACAAA 
ACACAGGGGA AGAGGGAACG 
AAGTCTATGA TGTGACGGAT 
AGGCACAGCG TCAATTAGCA 
AAACACAGAC AATAAATGGA 
CGCAGCAACG AGATAAAGCC 
AAAAAGCTAG CAACCTAGTA 
CGCATATTCA TTTATATCCA 
AAACGGTACT CGATAAGCAA 
CGACTCAGAT TCCAGCGAAT 
ATACAACGTT GACACTTTTA 
CTACAGGTTA GACACTGACG 
ACTTACAAAA CTTTGCAAAT 
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CAAACAATGA CTGTGTCGTA TCAAATGCGT TTAGAAAAGA CCGCTGAACC TGACACTGCG 
ATTAACAACG AAGGACAATT AGTCACGGAC AAACATACCT TGACTAAAAG AGCCACAGTT 
CGTACAGGCG GCAAGTCTTT TGTCAAAGTT GATAGTGAAA ATGCGAAAAT CACCTTGCCA 
GAGGCTGTTT TTATCGTCAA AAATCAAGCG GGGGAATACC TCAATGAAAC AGCAAACGGG 
TATCGTTGGC AAAAAGAAAA AGCATTAGCT AAAAAATTCA CGTCTAATCA AGCCGGTGAA 
TTTTCAGTTA AAGGNNTTAA AAGATGGCCA GTACTTCTTG GAAGAAATCT CTGCACCAAA 
AGGTTATCTT CTGAATCAAA CAGAAATTCC TTTTACGGTG GGAAAAAATT CTTATGCAAC 
GAACGGACAA CGAACAGCAC CGTTACATGT AATCAATAA 



EF072-2 {SEQ ID NO:274) 

MKNARWLSI CVMLLALFGF SQQALAEASQ ASVQVTLHKL LFPDGQLPEQ QQNTGEEGTL 
LQNYRGLNDV TYQVYDVTDP FYQLRSEGKT VQEAQRQLAE TGATNRKPIA EDKTQTINGE 
DGWSFSLAS KDSQQRDKAY LFVEAEAPEV VKEKASNLW ILPVQDPQGQ SLTHIHLYPK 
NEENAYDLPP LEKTVLDKQQ GFNQGEHINY QLTTQIPANI LGYQEFRLSD KADTTLTLLP 
ESIEVKVAGK TVTTGYTLTT QKHGFTLDFS IKDLQNFANQ TMTVSYQMRL EKTAEPDTAI 
NNEGQLVTDK HTLTKRATVR TGGKSFVKVD SENAKITLPE AVFIVKNQAG EYLNETANGY 
RWQKEKALAK KFTSNQAGEF SVKGXKRWPV LLGRNLCTKR LSSESNRNSF YGGKKFLCNE 
RTTNSTVTCN Q 



EF072-3 (SEQ ID NO:275) 



ATTACCAGAA CAGCAGCAAA ACACAGGGGA 
CTGCTTCAAA ATTATCGGGG CTTAAATGAC 
CCGTTTTATC AGCTTCGTTC TGAAGGAAAA 
GAAACCGGTG CAACAAATAG AAAACCGATC 
GAAGATGGAG TGGTTTCTTT TTCATTAGCT 
TATTTATTTG TTGAAGCGGA AGCACCAGAA 
GTGATTTTGC CTGTTCAAGA TCCACAAGGG 
AAAAATGAAG AAAATGCCTA TGACTTACCA 
CAAGGCTTTA ATCAAGGAGA GCACATTAAC 
ATTTTAGGAT ATCAGGAATT CCGTTTGTCA 
CCAGAATCAA TTGAGGTAAA AGTGGCTGGA 
ACGCAAAAGC ATGGATTTAC GCTTGATTTT 
CAAACAATGA CTGTGTCGTA TCAAATGCGT 
ATTAACAACG AAGGACAATT AGTCACGGAC 
CGTACAGGCG GCAAGTCTTT TGTCAAAGTT 
GAGGCTGTTT TTATCGTCAA AAATCAAGCG 
TATCGTTGGC AAAAAGAAAA AGCATTAGCT 
TTTTCAGTTA AAGGNNTTAA AAGATGGCCA 
AGGTTATCTT CTGAATCAAA CAGAAATTCC 
GAACGGACAA CGAACAGCAC CGTTACATGT 

EF072-4 (SEQ ID NO:276) 



AGAGGGAACG 

GTCACTTATC AAGTCTATGA TGTGACGGAT 
ACGGTCCAAG AGGCACAGCG TCAATTAGCA 
GCAGAAGATA AAACACAGAC AATAAATGGA 
AGCAAAGATT CGCAGCAACG AGATAAAGCC 
GTGGTAAAGG AAAAAGCTAG CAACCTAGTA 
CAATCGTTAA CGCATATTCA TTTATATCCA 
CCACTTGAAA AAACGGTACT CGATAAGCAA 
TATCAGTTAA CGACTCAGAT TCCAGCGAAT 
GATAAGGCGG ATACAACGTT GACACTTTTA 
AAAACAGTTA CTACAGGTTA CACACTGACG 
TCAATTAAAG ACTTACAAAA CTTTGCAAAT 
TTAGAAAAGA CCGCTGAACC TGACACTGCG 
AAACATACCT TGACTAAAAG AGCCACAGTT 
GATAGTGAAA ATGCGAAAAT CACCTTGCCA 
GGGGAATACC TCAATGAAAC AGCAAACGGG 
AAAAAATTCA CGTCTAATCA AGCCGGTGAA 
GTACTTCTTG GAAGAAATCT CTGCACCAAA 
TTTTACGGTG GGAAAAAATT CTTATGCAAC 
A 



QLPEQ QQNTGEEGTL 
LQNYRGLNDV TYQVYDVTDP 
DGWSFSLAS KDSQQRDKAY 
NEENAYDLPP LEKTVLDKQQ 
ESIEVKVAGK TVTTGYTLTT 
NNEGQLVTDK HTLTKRATVR 
RWQKEKALAK KFTSNQAGEF 
RTTNSTVTC 



FYQLRSEGKT VQEAQRQLAE 
LFVEAEAPEV VKEKASNLW 
GFNQGEHINY QLTTQIPANI 
QKHGFTLDFS IKDLQNFANQ 
TGGKSFVKVD SENAKITLPE 
SVKGXKRWPV LLGRNLCTKR 



TGATNRKPIA EDKTQTINGE 
ILPVQDPQGQ SLTHIHLYPK 
LGYQEFRLSD KADTTLTLLP 
TMTVSYQMRL EKTAEPDTAI 
AVFIVKNQAG EYLNETANGY 
LSSESNRNSF YGGKKFLCNE 
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EF073-1 (SEQ ID NO: 277) 



TAAATGAACA AATTAAATAC AAAATTACTG 
ATTGCTGTCG CTAGAGAATA TGGCTTCTTC 
TTCGTTCTCT ATCGAAAAAA GAAAAATGCC 
ACGAAAGATA AAGAAGCCCA TTATCGTGAG 
TTCAGAAGTA CAATGAGCAC AGCCAAAAAA 
CGTTCAACTA AATTACGGGC GATTGACTTA 
CTGTTTAAAG AGTTAGTGAA AGAACCTAAA 
ACACATTTAC CAAATATCGT TGACTTAACA 
GTAAAAAACA AACAAACGTA TGAAAAATTA 
TCAAAATTAG TTAAAAATGA TTATGAGGAA 
GTCGAAATGT CGATCGCTAA AAGCAGCTTG 
CAAGTAAACG AAGACCAGCA ATAA 



ATTGGCTATA TTCTTTTAGG AGCCTTAATC 
GCTTTTGTGA TTCTGGTAGG CTTTTTAGTA 
GCCGACAAAA GCGATCAAAT GCCTTACTTA 
TTGGGGTTAT CTCCACAAGA AATTGATTTT 
CAAATCATAC AATTGCAAGA AAACATGAAT 
CGTAATGATA CTACGAAGGT TTCTAAAGCT 
AAGTTACACT TAGCCAATCA CTTTCTCTAT 
AGTAAACATT TAGAAATCGA ACAACACGAA 
GAAGAAAGCG CACAAATCAT TGACCAATTG 
ATCGTTTCCG ATGACTTAGA CGATTTAGAT 
TCGCAAAAAG CTGCAACTGA GGAATCACCT 



EF073-2 (SEQ ID NO:278) 

MNKLNTKLLI GYILLGALII AVAREYGFFA 
KDKEAHYREL GLSPQEIDFF RSTMSTAKKQ 
FKELVKEPKK LHLANHFLYT HLPNIVDLTS 
KLVKNDYEEI VSDDLDDLDV EMSIAKSSLS 

EF073-3 (SEQ ID NO:279) 



FVILVGFLVF VLYRKKKNAA DKSDQMPYLT 
IIQLQENMNR STKLRAIDLR NDTTKVSKAL 
KHLEIEQHEV KNKQTYEKLE ESAQIIDQLS 
QKAATEESPQ VNEDQQ 



CT ATCGAAAAAA GAAAAATGCC GCCGACAAAA GCGATCAAAT GCCTTACTTA 
ACGAAAGATA AAGAAGCCCA TTATCGTGAG TTGGGGTTAT CTCCACAAGA AATTGATTTT 
TTCAGAAGTA CAATGAGCAC AGCCAAAAAA CAAATCATAC AATTGCAAGA AAACATGAAT 
CGTTCAACTA AATTACGGGC GATTGACTTA CGTAATGATA CTACGAAGGT TTCTAAAGCT 
CTGTTTAAAG AGTTAGTGAA AGAACCTAAA AAGTTACACT TAGCCAATCA CTTTCTCTAT 
ACACATTTAC CAAATATCGT TGACTTAACA AGTAAACATT TAGAAATCGA ACAACACGAA 
GTAAAAAACA AACAAACGTA TGAAAAATTA GAAGAAAGCG CACAAATCAT TGACCAATTG 
TCAAAATTAG TTAAAAATGA TTATGAGGAA ATCGTTTCCG ATGACTTAGA CGATTTAGAT 
GTCGAAATGT CGATCGCTAA AAGCAGCTTG TCGCAAAAAG CTGCAACTGA GGAATCACCT 
CAAGTAAACG AAGACCAGCA AT 



EF073-4 (SEQ ID NO: 280) 



YRKKKNAA DKSDQMPYLT 

KDKEAHYREL GLSPQEIDFF RSTMSTAKKQ IIQLQENMNR STKLRAIDLR NDTTKVSKAL 
FKELVKEPKK LHLANHFLYT HLPNIVDLTS KHLEIEQHEV KNKQTYEKLE ESAQIIDQLS 
KLVKNDYEEI VSDDLDDLDV EMSIAKSSLS QKAATEESPQ VNEDQQ 



EF074-1 (SEQ ID NO:281) 

TAAAGGAGTT CTCAAAAAAT GAAGCTAAAA 
ACCGTTGCAG TTGGCTTGTG GTTAACGCCT 
ATGGTAGATA TCTCTGGCAA AAAAGTGTTG 
GGACGCGATG GTTACAAACA AGGAACATCA 
GCCTACAATG TCGTACCGGT TTCCTTCATG 
TTCAAGCCTT ATAACCAAAC GGACACTGCC 
CAAGGTCGCG CAGTTTTATT GGCACTTGGT 
GGCGATGAAC AAGCCTTTGC GAATGAAATC 
GGTTTAGACA TCGACTTAGA GCAATTGGCG 



AAAATAATTC CTGCTTTTCC CCTTCTTTCA 
ACTCAAGCTT CTGCAGATGC TGCGGATACG 
GTTGGATATT GGCATAACTG GGCCTCAAAA 
GCATCACTAA ACCTTTCAGA AGTAAATCAA 
AAAAGCGATG GCACGACACG GATTCCTACG 
TTCCGACAAG AAGTCGCACA ATTAAATAGT 
GGAGCAGATG CACATATTCA ATTAGTCAAA 
ATTCGTCAAG TGGAAACATA CGGCTTTGAT 
ATTACTGCTG GCGACAACCA AACCGTCATC 
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CCTGCTACGT TGAAAATAGT CAAAGACCAT TATCGAGCAC AAGGAAAAAA TTTCATCATT 
ACGATGGCAC CAGAATTCCC TTATTTAAAA CCTGGTGCCG CTTATGAAAC ATACATTACT 
TCCCTAAATG GTTATTATGA TTACATTGCC CCACAATTAT ATAACCAAGG GGGCGACGGT 
GTCTGGGTTG ATGAAGTTAT GACTTGGGTT GCTCAAAGCA ACGATGCTCT AAAATACGAG 
TTCCTCTATN ATATT 

EF074-2 (SEQ ID NO:282) 

MKLKK IIPAFPLLST VAVGLWLTPT QASADAADTM VDISGKKVLV GYWHNWASKG 
RDGYKQGTSA SLNLSEVNQA YN\A^VSFMK SDGTTRIPTF KPYNQTDTAF RQEVAQLNSQ 
GRAVLLALGG ADAHIQLVKG DEQAFANEII RQVETYGFDG LDIDLEQLAI TAGDNQTVIP 
ATLKIVKDHY RAQGKNFIIT MAPEFPYLKP GAAYETYITS LNGYYDYIAP QLYNQGGDGV 
WVDEVMTWVA QSNDALKYEF LYXI 

EF074-3 (SEQ ID NO:283) 

TGC TGCGGATACG 

ATGGTAGATA TCTCTGGCAA AAAAGTGTTG GTTGGATATT GGCATAACTG GGCCTCAAAA 
GGACGCGATG GTTACAAACA AGGAACATCA GCATCACTAA ACCTTTCAGA AGTA7VATCAA 
GCCTACAATG TCGTACCGGT TTCCTTCATG AAAAGCGATG GCACGACACG GATTCCTACG 
TTCAAGCCTT ATAACCAAAC GGACACTGCC TTCCGACAAG AAGTCGCACA ATTAAATAGT 
CAAGGTCGCG CAGTTTTATT GGCACTTGGT GGAGCAGATG CACATATTCA ATTAGTCAAA 
GGCGATGAAC AAGCCTTTGC GAATGAAATC ATTCGTCAAG TGGAAACATA CGGCTTTGAT 
GGTTTAGACA TCGACTTAGA GCAATTGGCG ATTACTGCTG GCGACAACCA AACCGTCATC 
CCTGCTACGT TGAAAATAGT CAAAGACCAT TATCGAGCAC AAGGAAAAAA TTTCATCATT 
ACGATGGCAC CAGAATTCCC TTATTTAAAA CCTGGTGCCG CTTATGAAAC ATACATTACT 
TCCCTAAATG GTTATTATGA TTACATTGCC CCACAATTAT ATAACCAAGG CGGCGACGGT 
GTCTGGGTTG ATGAAGTTAT GACTTGGGTT GCTCAAAGCA ACGATGCTCT AAAATACGAG 
TTCCTCT 



EF074-4 (SEQ ID NO: 284) 
AADTM VDISGKKVLV GYWHNWASKG 
RDGYKQGTSA SLNLSEVNQA YNWPVSFMK 
GRAVLLALGG ADAHIQLVKG DEQAFANEII 
ATLKIVKDHY RAQGKNFIIT MAPEFPYLKP 
WVDEVMTWVA QSNDALKYEF LY 



SDGTTRIPTF KPYNQTDTAF RQEVAQLNSQ 
RQVETYGFDG LDIDLEQLAI TAGDNQTVIP 
GAAYETYITS LNGYYDYIAP QLYNQGGDGV 



EF075-1 (SEQ ID NO:285) 

TAACCTATAA GAAAAAAATC ACAACCTGTG ATAAATTATT GGAGGNAAAA TATGTCAAAA 
GGGAAGAAAA TTTTTGCCAT TATCNTTGGA ATTATCTTGG NTCTATTTCT TGCAGTTGTT 
GGAATGGGAG CAAAACTTTA TTGGGATGTT TCTAAATCAA TGGATAAAAC CTATGAAACA 
GTAGAACGAT CTAAAAAAAG TCAGGTCAAT TTAAACAATA AGGAGCCTTT TTCTGTTTTA 
TTATTAGGGA TTGATACAGG CGATGATGGG CGTGTCGAGC AAGGTC-GTTC GGATACAACA 
ATTGTTGCAA CAGTTAATCC TCGTGACAAG CAAACAACCT TAGTCAGTCT TGCTCGC<3AT 
ACCTATGTTG ATATTCCAGG TCAAGGAAAA CAAGATAAAT TGAATCACGC CTATGCTTTT 
GGTGGCGCAT CTTTAGCAAT GGACACAGTT GAAAACTATT TAAACATACC TATTAATCAT 
TATGTTTCAA TTAATATGGC TGGTTTAAAA GAATTAGTCA ACGCGGTTGG CGGAATCGAA 
GTGAACAATA ATCTGACTTT TTCTCAAGAC GGATATGATT TTACGATTGG TAAAATTTCA 
TTGGATGGTG AACAAGCACT CTCCTATTCA AGAATGCGTT ACGAAGACCC TAATGGTGAC 
TACGGCCGCC AAGAACGTCA AAGAAAAGTG ATTGAAGGCA TCGTCCAAAA AGTCTTAAGT 
CTTAACAGCG TAAGCAACTA TCAAGAAATT TTAACAGCTG TTTCTGATAA TATGAAGACA 
GATTTAAGTT TTGATGACAT GAAAAAAATT GCCTTAGATT ATCGCAGTGC CTTTGGTAAA 
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GTGAAACAAG ACCAACTTCA AGGTACTGGT TTTATGCAAG ATGGTGTTTC CTATCAACGT 
GTGGATGAAC AAGAATTAAC TCGTGTCCAA CAAGAGTTGA AAAATCAATT GAATACAAAA 
TAA 

EF075-2 (SEQ ID NO:286) 

MSKG KKIFAIIXGI ILXLFLAWG MGAKLYWDVS KSMDKTYETV 
ERSKKSQVNL NNKEPFSVLL LGIDTGDDGR VEQGRSDTTI VATVNPRDKQ 
YVDIPGQGKQ DKLNHAYAFG GASLAMDTVE NYLNIPINHY VSINMAGLKE 
NNNLTFSQDG YDFTIGKISL DGEQALSYSR MRYEDPNGDY GRQERQRKVI 
NSVSNYQEIL TAVSDNMKTD LSFDDMKKIA LDYRSAFGKV KQDQLQGTGF 
DEQELTRVQQ ELKNQLNTK 



EF075-3 (SEQ ID NO: 287) 

ACTTTA TTGGGATGTT TCTAAATCAA TGGATAAAAC CTATGAAACA 
GTAGAACGAT CTAAAAAAAG TCAGGTCAAT TTAAACAATA AGGAGCCTTT TTCTGTTTTA 
TTATTAGGGA TTGATACAGG CGATGATGGG CGTGTCGAGC AAGGTCGTTC GGATACAACA 
ATTGTTGCAA CAGTTAATCC TCGTGACAAG CAAACAACCT TAGTCAGTCT TGCTCGCGAT 
ACCTATGTTG ATATTCCAGG TCAAGGAAAA CAAGATAAAT TGAATCACGC CTATGCTTTT 
GGTGGCGCAT CTTTAGCAAT GGACACAGTT GAAAACTATT TAAACATACC TATTAATCAT 
TATGTTTCAA TTAATATGGC TGGTTTAAAA GAATTAGTCA ACGCGGTTGG CGGAATCGAA 
GTGAACAATA ATCTGACTTT TTCTCAAGAC GGATATGATT TTACGATTGG TAAAATTTCA 
TTGGATGGTG AACAAGCACT CTCCTATTCA AGAATGCGTT ACGAAGACCC TAATGGTGAC 
TACGGCCGCC AAGAACGTCA AAGAAAAGTG ATTGAAGGCA TCGTCCAAAA AGTCTTAAGT 
CTTAACAGCG TAAGCAACTA TCAAGAAATT TTAACAGCTG TTTCTGATAA TATGAAGACA 
GATTTAAGTT TTGATGACAT GAAAAAAATT GCCTTAGATT ATCGCAGTGC CTTTGGTAAA 
GTGAAACAAG ACCAACTTCA AGGTACTGGT TTTATGCAAG ATGGTGTTTC CTATCAACGT 
GTGGATGAAC AAGAATTAAC TCGTGTCCAA CAAGAGTTGA AAAATCAATT GAATACAAAA 



EF075-4 (SEQ ID NO:288) 

KLYWDVS KSMDKTYETV 
ERSKKSQVNL NNKEPFSVLL LGIDTGDDGR VEQGRSDTTI VATVNPRDKQ TTLVSLARDT 
YVDIPGQGKQ DKLNHAYAFG GASLAMDTVE NYLNIPINHY VSINMAGLKE LVNAVGGIEV 
NNNLTFSQDG YDFTIGKISL DGEQALSYSR MRYEDPNGDY GRQERQRKVI EGIVQKVLSL 
NSVSNYQEIL TAVSDNMKTD LSFDDMKKIA LDYRSAFGKV KQDQLQGTGF MQDGVSYQRV 
DEQELTRVQQ ELKNQLNTK 



TTLVSLARDT 
LVNAVGGIEV 
EGIVQKVLSL 
MQDGVSYQRV 



EF076-1 (SEQ ID NO:289) 

TAGAAAATAA CAGAGGAGCT GAAGGAAATG 
AGCATTGCTG CAGTTGCAAG TGTCTCTGTT 
AAGGTATCTC ATGTTTCCAA TCGTTATAAA 
GGAAACCAAA AATTATTATC GATTGTCGAT 
TTAAATGTTG TGGATCGTGT GAAAGATGGC 
GTTAAAGACA ATACAGATTC TTTAAAAGAA 
AAGTTAAAAA AGTGGCCTAG GCCATCTTTT 
TAA 



AAAGCATCAA CAAAAATTGG TATCGGTTTA 
GCAGTCATCG CTTCTGAAAA AATTATTAAG 
GTTAAAAAGT TTGTAGACGA TAAATTTGAT 
GATTTATCCG ATGATGAATT AGATTCTGTT 
GGTTCAAAAT TAGCTGAATA TGGCGAAAAA 
CGCTTTTTCA CATTTATTGA AGATGCAATG 
TTTTATAAAA ATAATTCTTT TGTTTCAACA 
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EF076-2 (SEQ ID NO:290) 

MK ASTKIGIGLS lAAVASVSVA VIASEKIIKK VSHVSNRYKV KKFVDDKFDG 
NQKLLSIVDD LSDDELDSVL NWDRVKDGG SKLAEYGEKV KDNTDSLKER FFTFIEDAMK 
LKKWPRPSFF YKNNSFVST 

EF076-3 {SEQ ID NO:291) 



CATCG CTTCTGAAAA AATTATTAAG 
AAGGTATCTC ATGTTTCCAA TCGTTATAAA 
GGAAACCAAA AATTATTATC GATTGTCGAT 
TTAAATGTTG TGGATCGTGT GAAAGATGGC 
GTTAAAGACA ATACAGATTC TTTAAAAGAA 
AAGTTAAAAA AGTGGCCTAG GCCATCTTTT 



GTTAAAAAGT TTGTAGACGA TAAATTTGAT 
GATTTATCCG ATGATGAATT AGATTCTGTT 
GGTTCAAAAT TAGCTGAATA TGGCGAAAAA 
CGCTTTTTCA CATTTATTGA AGATGCAATG 
TTTTATAAAA ATAATTCTT 



EF076-4 (SEQ ID NO:292) 



VIASEKIIKK VSHVSNRYKV KKFVDDKFDG 

NQKLLSIVDD LSDDELDSVL NWDRVKDGG SKLAEYGEKV KDNTDSLKER FFTFIEDAMK 
LKKWPRPSFF YKNNS 



EF077-1 (SEQ ID NO: 293) 



TAATGTAAAG TGAATGATGG 
ACAATTATAA CAGGAGTTTT 
GCGTATGGCA TTATTTTAAT 
ATTCAAACCT TACGT6AAGG 
GCAACCTTAG CTGTGGGAGA 
GGTGATTCAT TAGAAGACTA 
GATAACTCGC CACAAAAAGC 
GAGGAAATCA ATGTTGGCGA 
GGCTTGGTAA AAACCGGGAC 
CCAATTGAAA AAAATCCTGG 
TTGAAAATGG TTGCTGAAAA 
GTGAAAGAAT CTGCGGCGCG 
CCTTTTACAC TAGTTGCCTA 
ACACGTTTTG CGGAAGTCTT 
ATTGCTTTAG TGGCAGGGAT 
ACGATGGTCG AAAAATTAGC 
ACGCAAGGAC AACTTTCTGT 
GAATTAGTGG GATTGGCAGC 
ATTGTTGCTT ATGCCAGAAA 
GTTTCTGGTG CTGGCGTGAA 
AATTTTGTGA CACAAGAGTC 
TCACGTAATG GCACATATTT 
AAAGAGACTA TGGAAAAATT 
GATCAAGAAT CCGTTGCAGA 
GAATGTTTAC CACAAGATAA 
GTCATCATGG TAGGAGATGG 
ATTGCTATGG GTGCTCATGG 
AAAGATGACT TAAGTAAAGT 
GCCAAACAAT CTGTATTAAT 
ACCGGGATCA TTCCGGCGCT 
ATCTTATCTG CTTTGCGTGC 



GAGAGAAAAA GAGATGAAGC 
GGCATTATTA TTTGAATTTA 
AACAGGTTCT GTAATGGCGT 
AAAATATGGT GTCGATATTT 
ATACTGGGCC AGTTTGATGA 
TGCCGCTGGA AAAGCTAACC 
TCATCGCTTG AATGGCGAAA 
TGAATTAGTA GTAAAACCAG 
ATCAACAGTC GATGAATCTT 
GGATGAATTA ATGTCGGGTT 
AACTGTAGCA GACAGTCAAT 
TCCAGCTCAT TTTGTACGTT 
CCTAATTGCA GGTGTTGCTT 
AGTTGTTGCT TCGCCGTGTC 
GGGTCGTTCA AGTCGTCATG 
TTCTGCAAAA ACGATTGCGT 
TGATCAAGTC CAACCAATCA 
AAGCGTGGAA CAAGAATCAA 
GCAAGATGTC CCATTAAAAA 
GGCATTTGTG GATGGTGCTG 
TCAAGAAACT GAAAAAATTG 
AGGCCGAATT ACTTTTACAG 
ACACCAATTA CATCTTCAAC 
AACGATTGCT GCAGAAGTAG 
ATTAACTATT CTAAAAGAAT 
TGTAAATGAT GCACCTTCGC 
AGCTACTGCG GCTAGTGAAA 
CAGCCAAGCG GTCGAAATTG 
CGGAATTTTT ATCTGCGTTT 
AATCGGGGCT ATGCTACAAG 
TCGTCGAATT GGCCAGTAA 



ATGTAACAAA ATTGGGGATT 
TTTTACATCA GCCGAATTGG 
TAATGATGTT CTGGGAAATG 
TAGCGATTAC CGCTATCGTT 
TTTTAATTAT GTTGACTGGT 
AAGAGCTGAA GTCATTATTG 
ATTTAGAAGA TGTTTCTGTT 
GGGAACTAGT TCCAGTTGAT 
CATTAACAGG AGAATCAAAA 
CCGTGAATGG TGACGGCTCT 
ATCAAACAAT TGTGAACTTA 
TAGCAGATCG CTATGCGGTA 
GGTTTGTTTC AAAAAGTCCG 
CTTTAATTCT ATCTGGCCCA 
GGGTCGTTAT TAAATCGGGA 
TTGATAAAAC AGGCACGATT 
ATGCTGGAAT AACTGCTGCT 
GTCATATTTT AGCTAGATCA 
ATATTACAGA TCTAGCGGAA 
AGATACGGGT AGGTAAAAAG 
ATAAAACGAC TATTCATATT 
ACACTGTACG CCCAGAAGCA 
GAATTTTAAT GCTGACGGGG 
GAATTACCGA AGTACATGGG 
TGCCTAAAGA AAATCATCCA 
TTGCTGCTGC AGACGTAGGT 
CTGCTGACGT TGTTATTTTA 
CCCAAGATAC CATGAAAATT 
TACTAATGTT AATTGCTAGT 
AAGTCGTGGA CACTGTGTCA 
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EF077-2 (SEQ ID NO:294) 

MKHVTKLGIT IITGVLALLF EFILHQPNWA YGIILITGSV MALMMFWEMI 
QTLREGKYGV DILAITAIVA TLAVGEYWAS LMILIMLTGG DSLEDYAAGK ANQELKSLLD 
NSPQKAHRLN GENLEDVSVE EINVGDELW KPGELVPVDG LVKTGTSTVD ESSLTGESKP 
lEKNPGDELM SGSVNGDGSL KMVAEKTVAD SQYQTIVNLV KESAARPAHF VRLADRYAVP 
FTLVAYLIAG VAWFVSKSPT RFAEVLWAS PCPLILSAPI ALVAGMGRSS RHGWIKSGT 
MVEKLASAKT lAFDKTGTIT QGQLSVDQVQ PINAGITAAE LVGLAASVEQ ESSHILARSI 
VAYARKQDVP LKNITDLAEV SGAGVKAFVD GAEIRVGKKN FVTQESQETE KIDKTTIHIS 
RNGTYLGRIT FTDTVRPEAK ETMEKLHQLH LQRILMLTGD QESVAETIAA EVGITEVHGE 
CLPQDKLTIL KELPKENHPV IMVGDGVNDA PSLAAADVGI AMGAHGATAA SETADWILK 
DDLSKVSQAV EIAQDTMKIA KQSVLIGIFI CVLLMLIAST GIIPALIGAM LQEWDTVSI 
LSALRARRIG Q 

EF077-3 (SEQ ID NO:295) 

TCA GCCGAATTGG 

GCGTATGGCA TTATTTTAAT AACAGGTTCT GTAATGGCGT TAATGATGTT CTGGGAAATG 
ATTCAAACCT TACGTGAAGG AAAATATGGT GTCGATATTT TAGCGATTAC CGCTATCGTT 
GCAACCTTAG CTGTGGGAGA ATACTGGGCC AGTTTGATGA TTTTAATTAT GTTGACTGGT 
GGTGATTCAT TAGAAGACTA TGCCGCTGGA AAAGCTAACC AAGAGCTGAA GTCATTATTG 
GATAACTCGC CACAAAAAGC TCATCGCTTG AATGGCGAAA ATTTAGAAGA TGTTTCTGTT 
GAGGAAATCA ATGTTGGCGA TGAATTAGTA GTAAAACCAG GGGAACTAGT TCCAGTTGAT 
GGCTTGGTAA AAACCGGGAC ATCAACAGTC GATGAATCTT CATTAACAGG AGAATCAAAA 
CCAATTGAAA AAAATCCTGG GGATGAATTA ATGTCGGGTT CCGTGAATGG TGACGGCTCT 
TTGAAAATGG TTGCTGAAAA AACTGTAGCA GACAGTCAAT ATCAAACAAT TGTGAACTTA 
GTGAAAGAAT CTGCGGCGCG TCCAGCTCAT TTTGTACGTT TAGCAGATCG CTATGCGGTA 
CCTTTTACAC TAGTTGCCTA CCTAATTGCA GGTGTTGCTT GGTTTGTTTC AAAAAGTCCG 
ACACGTTTTG CGGAAGTCTT AGTTGTTGCT TCGCCGTGTC CTTTAATTCT ATCTGCCCCA 
■ATTGCTTTAG TGGCAGGGAT GGGTCGTTCA AGTCGTCATG GGGTCGTTAT TAAATCGGGA 
ACGATGGTCG AAAAATTAGC TTCTGCAAAA ACGATTGCGT TTGATAAAAC AGGCACGATT 
ACGCAAGGAC AACTTTCTGT TGATCAAGTC CAACCAATCA ATGCTGGAAT AACTGCTGCT 
GAATTAGTGG GATTGGCAGC AAGCGTGGAA CAAGAATCAA GTCATATTTT AGCTAGATCA 
ATTCTTGCTT ATGCCAGAAA GCAAGATGTC CCATTAAAAA ATATTACAGA TCTAGCGGAA 
GTTTCTGGTG CTGGCGTGAA GGCATTTGTG GATGGTGCTG AGATACGGGT AGGTAAAAAG 
AATTTTGTGA CACAAGAGTC TCAAGAAACT GAAAAAATTG ATAAAACGAC TATTCATATT 
TCACGTAATG GCACATATTT AGGCCGAATT ACTTTTACAG ACACTGTACG CCCAGAAGCA 
AAAGAGACTA TGGAAAAATT ACACCAATTA CATCTTCAAC GAATTTTAAT GCTGACGGGG 
GATCAAGAAT CCGTTGCAGA AACGATTGCT GCAGAAGTAG GAATTACCGA AGTACATGGG 
GAATGTTTAC CACAAGATAA ATTAACTATT CTAAAAGAAT TGCCTAAAGA AAATCATCCA 
GTCATCATGG TAGGAGATGG TGTAAATGAT GCACCTTCGC TTGCTGCTGC AGACGTAGGT 
ATTGCTATGG GTGCTCATGG AGCTACTGCG GCTAGTGAAA CTGCTGACGT TGTTATTTTA 
AAAGATGACT TAAGTAAAGT CAGCCAAGCG GTCGAAATTG CCCAAGATAC CATGAAAATT 
GCCAAACAAT CTGTATTAAT CGGAATTTTT ATCTGCGTTT TACTAATGTT AATTGCTAGT 
ACCGGGATCA TTCCGGCGCT AATCGGGGCT ATGCTACAAG AAGTCGTGGA CACTGTGTCA 
ATCTTATCTG CTTTGCGTGC TCGTCGAATT GGCC 

EF077-4 (SEQ ID NO:296) 

QPNWA YGIILITGSV MALMblFWEMI 

QTLREGKYGV DILAITAIVA TLAVGEYWAS LMILIMLTGG DSLEDYAAGK ANQELKSLLD 
NSPQKAHRLN GENLEDVSVE EINVGDELW KPGELVPVDG LVKTGTSTVD ESSLTGESKP 
lEKNPGDELM SGSVNGDGSL KMVAEKTVAD SQYQTIVNLV KESAARPAHF VRLADRYAVP 
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FTLVAYLIAG VAWFVSKSPT RFAEVLWAS 
MVEKLASAKT lAFDKTGTIT QGQLSVDQVQ 
VAYARKQDVP LKNITDLAEV SGAGVKAFVD 
RNGTYLGRIT FTDTVRPEAK ETMEKLHQLH 
CLPQDKLTIL KELPKENHPV IMVGDGVNDA 
DDLSKVSQAV EIAQDTMKIA KQSVLIGIFI 
LSALRARRIG 



PCPLILSAPI ALVAGMGRSS RHGWIKSGT 
PINAGITAAE LVGLAASVEQ ESSHILARSI 
GAEIRVGKKN FVTQESQETE KIDKTTIHIS 
LQRILMLTGD QESVAETIAA EVGITEVHGE 
PSLAAADVGI AMGAHGATAA SETADWILK 
CVLLMLIAST GIIPALIGAM LQEWDTVSI 



EF079-1 (SEQ ID NO:297) 

TAATTTCTAG CATCACCGAA GAAATTTTTA 
CCCAGGCTCT CATGCTTTAT TTTTAAGGAG 
ATCATTGATG GTTTTATGAT TCTTTTACTG 
TTTGTTAGCG ATGCATTAAA TAACTATCTG 
AAAGCAAGCC AAGAAAACAC CAAAGAAATG 
AACCAAGAAT TAGCGAAAAA AGGCAGCAAT 
AAAACAACGA AAAAACCAGA CAAATCCTAT 
ATTCCAAAAA TAAATGTCCG TTTACCAATT 
AAAGGAAGCT CCTTGTTAGA AGGAACCTCC 
GTCATTTCAG GCCATCGTGG TCTCCCTCAA 
AAAAAAGGCG ATGAATTTTA TATCGAAGTC 
CAAATAAAAA CCGTTGAACC AACTGATACA 
CTCGTCACTT TATTAACTTG CACACCGTAT 
GGACATCGTA TCCCATATCA ACCAGAAAAA 
CAACAAAATT TACTATTATG GACATTACTT 
TTCATTATCT GGTACAAGCG ACGGAAAAAG 



GAAAAACAAA GAGCCTGGGC CAATCACTGT 
GAAGCAATGA AGTCAAAAAA GAAACGTCGT 
ATTATTGGAA TAGGTGCATT TGCGTATCCT 
GATCAACAAA TTATCGCTCA TTATCAAGCA 
GCTGAACTTC AAGAAAAAAT GGAAAAGAAA 
CCTGGATTAG ATCCTTTTTC TGAAACGCAA 
TTTGAAAGTC ATACGATTGG TGTTTTAACC 
TTTGATAAAA CGAATGCATT GCTATTGGAA 
TATCCTACAG GTGGTACGAA TACACATGCG 
GCCAAATTAT TTACAGATTT GCCAGAATTA 
AATGGGAAGA CGCTTGCTTA TCAAGTAGAT 
AAAGATTTAC ACATTGAGTC TGGCCAAGAT 
ATGATAAACA GTCATCGGTT ATTAGTTCGA 
GCAGCAGCGG GGATGAAAAA AGTGGCACAA 
TTAATTGCCT GTGCGTTAAT TATTAGCGGC 
ACGACCAGAA AACCAAAGTA G 



EF079-2 (SEQ ID NO:298) 



MKSKKKRRI IDGFMJ^LLI IGIGAFAYPF 
VSDALNNYLD QQIIAHYQAK ASQENTKEMA 
TTKKPDKSYP ESHTIGVLTI PKINVRLPIF 
ISGHRGLPQA KLFTDLPELK KGDEFYIEVN 
VTLLTCTPYM INSHRLLVRG HRIPYQPEKA 
IIWYKRRKKT TRKPK 



ELQEKMEKKN QELAKKGSNP GLDPFSETQK 
DKTNALLLEK GSSLLEGTSY PTGGTNTHAV 
GKTLAYQVDQ IKTVEPTDTK DLHIESGQDL 
AAGMKKVAQQ QNLLLWTLLL lACALIISGF 



EF079-3 (SEQ ID NO:299) 
TCCT 

TTTGTTAGCG ATGCATTAAA TAACTATCTG 
AAAGCAAGCC AAGAAAACAC CAAAGAAATG 
AACCAAGAAT TAGCGAAAAA AGGCAGCAAT 
AAAACAACGA AAAAACCAGA CAAATCCTAT 
ATTCCAAAAA TAAATGTCCG TTTACCAATT 
AAAGGAAGCT CCTTGTTAGA AGGAACCTCC 
GTCATTTCAG GCCATCGTGG TCTCCCTCAA 
AAAAAAGGCG ATGAATTTTA TATCGAAGTC 
CAAATAAAAA CCGTTGAACC AACTGATACA 
CTCGTCACTT TATTAACTTG CACACCGTAT 
GGACATCGTA TCCCATATCA ACCAGAAAAA 
CAACAAAATT TACTATTATG GACATTACTT 
TTCATTATCT GGTACAAGCG ACGGAAAAAG 



GATCAACAAA TTATCGCTCA TTATCAAGCA 
GCTGAACTTC AAGAAAAAAT GGAAAAGAAA 
CCTGGATTAG ATCCTTTTTC TGAAACGCAA 
TTTGAAAGTC ATACGATTGG TGTTTTAACC 
TTTGATAAAA CGAATGCATT GCTATTGGAA 
TATCCTACAG GTGGTACGAA TACACATGCG 
GCCAAATTAT TTACAGATTT GCCAGAATTA 
AATGGGAAGA CGCTTGCTTA TCAAGTAGAT 
AAAGATTTAC ACATTGAGTC TGGCCAAGAT 
ATGATAAACA GTCATCGGTT ATTAGTTCGA 
GCAGCAGCGG GGATGAAAAA AGTGGCACAA 
TTAATTGCCT GTGCGTTAAT TATTAGCGGC 
ACGACCAGAA AACCAA 



EF079-4 (SEQ ID NO:300) 
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PF 

VSDALNNYLD QQIIAHYQAK ASQENTKEMA ELQEKMEKKN QELAKKGSNP GLDPFSETQK 
TTKKPDKSYF ESHTIGVLTI PKINVRLPIF DKTNALLLEK GSSLLEGTSY PTGGTNTHAV 
ISGHRGLPQA KLFTDLPELK KGDEFYIEVN GKTLAYQVDQ IKTVEPTDTK DLHIESGQDL 
VTLLTCTPYM INSHRLLVRG HRIPYQPEKA AAGMKKVAQQ QNLLLWTLLL lACALIISGF 
IIWYKRRKKT TRKP 

EF080-1 (SEQ ID NO:301) 

TAGTTACACT CGrPTAGGGC TAGCAACGTT AGGCATTTTC GCTGGACTCT TAGCACTCTT 
TTTATTAGGA GGTTATTTCC TATGAAAAAA CGACTTTTAC CTATTTTTTT CCTAATACTT 
CTTACCTTTC GCCTTCCCCT ACCCGTTTCG GCGGCTGAAA ATTCAATTGA TGATGGCGCA 
CAATTACTGA CACCTGATCA AATCAACCAA CTAAAGCAA6 AGATACAACC TTTAGAAGAA 
AAAACAAAAG CCTCTOTCTT TATTCTAACC ACAAATAATA ATACCTATGG CGATGAACAA 
GAATATOCAG ATCATTATCT TTTAAATAAA GTTGGCAAGG ACCAAAATGC GATTCTTTTT 
CTCATTCATA TGGACTTACG GAAAATCTAC ATCTCTACTT CTGGAAACAT GATTGATTAT 
ATCACAGATG CACGAATTCA TGATACCTTA GATAAAATAT GGGATAATAT GAGTCAAGGA 
AATTATTTCG CGGCTGCTCA AACCTTTCTT CAGGAAACTC AAGCATTTGT TAATAAAGGG 
GTTCCTGGGG GGCACTATCG TCTGGACAGC GAAACAGGTA AAATCACTCG TTATAAAGTC 
ATTACCCCGC TCGAAATGGT AATTCCTTTT GCTGCTGCGC TGATACTCAG TTTGGTCTTC 
TTAGGCATTA ATATTTCTAA ATATCAATTA AAATTTTCAA GTTATCAATA TCCCTTTAGG 
GAAAAAACAA CTTTAAACTT AACCTCCCGC ACAGATCAGT TAACCAACTC TTTCATCACT 
ACGCGTCGTA TTCCTAAAAA CAATCGCGGC AGTGGCGGAA TGGGCGGTGG TGGTAGCACC 
ACCCACTCAA CTGGCGGCGG CACATTCGGT GGCGGCGGTC GAAGTTTTTA G 

EF080-2 (SEQ ID NO:302) 

MKKR LLPIFFLILL TFGLALPVSA AENSIDDGAQ ^„r>^„nTTr.T 
LLTPDQINQL KQEIQPLEEK TKASVFIVTT NNNTYGDEQE YADHYLLNKV GKDQNAILFL 
IDMDLRKIYI STSGNMIDYM TDARIDDTLD KIWDNMSQGN YFAAAQTFVQ ETQAFVNKGV 
PGGHYRVDSE TGKITRYKVI TPLEMVIAFA AALILSLVFL GINISKYQLK FSSYQYPFRE 
KTTLNLTSRT DQLTNSFITT RRIPKNNGGS GGMGGGGSTT HSTGGGTFGG GGRSF 

EF080-3 (SEQ ID NO:303) 
GGCTGAAA ATTCAATTGA TGATGGCGCA 

CAATTACTGA CACCTCATCA AATCAACCAA CTAAAGCAAG AGATACAACC TTTAGAAGAA 
AAAACAAAAG CCTCTGTCTT TATTCTAACC ACAAATAATA ATACCTATCG CGATCAACAA 
GAATATCCAG ATCATTATCT TTTAAATAAA GTTCGCAAGG ACCAAAATCC GATTCTTTTT 
CTCATTCATA TGGACTTACG GAAAATCTAC ATCTCTACTT CTGGAAACAT GATTGATTAT 
ATCACAGATC CACGAATTCA TCATACCTTA GATAAAATAT GGGATAATAT GAGTCAAGGA 
AATTATTTCG CGGCTCCTCA AACCTTTCTT CAGGAAACTC AAGCATTTGT TAATAAAGGG 
GTTCCTGGGG GGCACTATCG TCTCGACAGC GAAACAGGTA AAATCACTCG TTATAAAGTC 
ATTACCCCGC TCGAAATGGT AATIGCTTTr GCTGCTCCGC TCATACTCAG TTTCGTCTTC 
TTAGGCATTA ATATTTCTAA ATATCAATTA AAATTTTCAA GTTATCAATA TCCCTTTAGG 
GAAAAAACAA CTTTAAACTT AACCTCCCGC ACAGATCAGT TAACCAACTC TTPCATCACT 
ACGCGTCGTA TTCCTAAAAA CAATCGCGGC AGTGGCGGAA TGGGCGGTCG TGGTAGCACC 
ACCCACTCAA CTGGCGGCGG CACATTCGGT GGCGGCGGTC GAAGT 

EF080-4 (SEQ ID NO:304) 



LLTPDQINQL KQEIQPLEEK TKASVFIVTT NNNTYGDEQE YADHYLLNKV GKDQNAILFL 
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IDMDLRKIYI STSGNMIDYM TDARIDDTLD KIWDNMSQGN YFAAAQTFVQ ETQAFVNKGV 
PGGHYRVDSE TGKITRYKVI TPLEMVIAFA AALILSLVFL GINISKYQLK FSSYQYPFRE 
KTTLNLTSRT DQLTNSFITT RRIPKNNGGS GGMGGGGSTT HSTGGGTFGG GGRS 

EF081-1 {SEQ ID NO:305) 

TGAATGGAAC GAAGCAATCG TAATAAAAAA TCTTCAAAAA AACCACTTAT TCTTGGTGTT 
TCTGCCTTGG TTCTAATCGC TGCTGCCGGT GGCGGGTATT ATGCTTATAG TCAATGGCAA 
GCCAAACAAG AATTAGCCGA AGCGAAGAAA ACAGCTACTA CATTTTTAAA CGTATTGTCA 
AAACAGGAAT TTGATAAGTT ACCGTCCGTT GTTCAAGAAG CTAGCTTAAA GAAAAATGGC 
TATGATACTA AATCTGTTGT TGAAAAATAC CAAGCAATTT ATTCAGGGAT TCAAGCAGAA 
GGAGTCAAAG CTAGTGATGT TCAAGTCAAA AAGGCGAAAG ACAATCAATA CACATTTACC 
TATAAATTAT CGATGAGCAC GCCTTTAGGC GAAATGAAAG ATTTGTCTTA TCAA TCAAGT 
ATCGCCAAAA AAGGCGATAC CTACCAAATC GCTTGGAAGC CATCTTTAAT TTTTCCAGAT 
ATGTCAGGAA ATGATAAAAT TTCGATTCAA GTAGATAATG CCAAACGTGG AGAAATTGTC 
GATCGTAATG GTAGTGGGCT AGCAATTAAC AAAGTGTTTC ACGAAGTGGG CGTAGTGCCT 
GGCAAACTCG GTTCTGGCGC AGAAAAAACA GCCAATATCA AAGCTTTTAG TGATAAATTC 
GGCGTTTCTG TTGATGAAAT CAATCAAAAG TTAAGCCAAG GATGGGTCCA AGCAGACTCC 
TTTGTACCAA TCACAGTCGC TTCTGAACCA GTGACAGAAT TACCAACAGG GGCTGCGACA 
AAAGATACAG AGTCACGTTA TTATCCGCTG -GGGGAAGCAN TGCGCAATTA A 

EF081-2 (SEQ ID NO:306) 

MERSNRNKKS SKKPLILGVS ALVLIAAAGG GYYAYSQWQA KQELAEAKKT ATTFLNVLSK 
QEFDKLPSW QEASLKKNGY DTKSWEKYQ AIYSGIQAEG VKASDVQVKK AKDNQYTFTY 
KLSMSTPLGE MKDLSYQSSI AKKGDTYQIA WKPSLIFPDM SGNDKISIQV DNAKRGEIVD 
RNGSGLAINK VFDEVGWPG KLGSGAEKTA NIKAFSDKFG VSVDEINQKL SQGWVQADSF 
VPITVASEPV TELPTGAATK DTESRYYPLG EAXRN 

EF081-3 (SEQ ID NO: 307) 

T GGCGGGTATT ATGCTTATAG TCAATGGCAA 

GCCAAACAAG AATTAGCCGA AGCGAAGAAA ACAGCTACTA CATTTTTAAA CGTATTGTCA 
AAACAGGAAT TTGATAAGTT ACCGTCCGTT GTTCAAGAAG CTAGCTTAAA GAAAAATGGC 
TATGATACTA AATCTGTTGT TGAAAAATAC CAAGCAATTT ATTCAGGGAT TCAAGCAGAA 
GGAGTCAAAG CTAGTGATGT TCAAGTCAAA AAGGCGAAAG ACAATCAATA CACATTTACC 
TATAAATTAT CGATGAGCAC GCCTTTAGGC GAAATGAAAG ATTTGTCTTA TCAATCAAGT 
ATCGCCAAAA AAGGCGATAC CTACCAAATC GCTTGGAAGC CATCTTTAAT TTTTCCAGAT 
ATGTCAGGAA ATGATAAAAT TTCGATTCAA GTAGATAATG CCAAACGTGG AGAAATTGTC 
GATCGTAATG GTAGTGGGCT AGCAATTAAC AAAGTGTTTG ACGAAGTGGG CGTAGTGCCT 
GGCAAACTCG GTTCTGGCGC AGAAAAAACA GCCAATATCA AAGCTTTTAG TGATAAATTC 
GGCGTTTCTG TTGATGAAAT CAATCAAAAG TTAAGCCAAG GATGGGTCCA AGCAGACTCC 
TTTGTACCAA TCACAGTCGC TTCTGAACCA GTGACAGAAT TACCAACAGG <3GCTGCGACA 
AAAGATACAG AGTCACGTTA TTATCCGCTG GGGG 

EF081-4 (SEQ ID NO:308) 

G GYYAYSQWQA KQELAEAKKT ATTFLNVLSK 

QEFDKLPSW QEASLKKNGY DTKSWEKYQ AIYSGIQAEG VKASDVQVKK AKDNQYTFTY 
KLSMSTPLGE MKDLSYQSSI AKKGDTYQIA WKPSLIFPDM SGNDKISIQV DNAKRGEIVD 
RNGSGLAINK VFDEVGWPG KLGSGAEKTA NIKAFSDKFG VSVDEINQKL SQGWVQADSF 
VPITVASEPV TELPTGAATK DTESRYYPLG 
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EF082-1 (SEQ ID NO:309) 

TAAAAAATGA AAAAGATCGT GCGCATTTCA AGCATTTTGT TCGTTGCTAC GCCTCTTATG 
CTTTTAAATA GTTCAAAAGT TGAAGCAGCT CAAGTCGCTT CTATTCAATC CAACGCTGAT 
ATTACGTTTG GTCTTGATAA TACTGTCACG CCACCTGTCA ACCCGACGAA CCCTTCTCAG 
CCTGTGACAC CTAATCCTGC TGATCCTCAT CAACCTGGTA CAGCCGGACC CCTTAGTATT 
GACTATGTTT CAAATATCCA TTTTGGATCA AAACAAATTC AAGCCGGAAC AGCGATCTAT 
TCGGCACAAC TGGATCAAGT GCAAAATAGT ACTGGCGATT TAATTAGCGT GCCAAACTAT 
GTTCAAGTAA CTGACAAACG TGGTCTAAAT CTTGGCTGGA AATTATCAGT TAAACAGAGT 
GCGCAATTTG CTACAAGTGA TTCAACACCC GCTGTTTTGG ATAATGCATC CTTGACCTTT 
TTAGCAGCAA CACCCAATTC AACACAGTTA CTTTCTTTGG CGCCATTAAC GGTCCCAGTA 
ACCTTGGATC CAACTGGTGC CGCCACTTCT CCTGTGGCGA CTGCCGCTCT TTCAACAGGA 
ATGGGCACTT GGACATTAGC TTTTGGTAGC GGANCGACCG CTGCTCAAGG CATTCAATTA 
ACTGTTCCTG CGACAACGAA AAAAGTTGCA GCTAAACAAT ATAAAACAAC GCTTACTTGG 
ATTTTGGATG ATACACCACT TTAA 

EF082-2 (SEQ ID NO:310) 

MKKIVRISS ILFVATPLML LNSSKVEAAQ VASIQSNADI TFALDNTVTP PVNPTNPSQP 
VTPNPADPHQ PGTAGPLSID YVSNIHFGSK QIQAGTAIYS AQLDQVQNST GDLISVPNYV 
QVTDKRGLNL GWKLSVKQSA QFATSDSTPA VLDNASLTFL AATPNSTQLL SLAPLTVPVT 
LDPTGAATSP VATAALSTGM GTWTLAFGSG XTAAQGIQLT VPATTKKVAA KQYKTTLTWI 
LDDTPL 

EF082-3 (SEQ ID N0:311) 

AGCT CAAGTCGCTT CTATTCAATC CAACGCTGAT 

ATTACGTTTG CTCTTGATAA TACTGTCACG CCACCTGTCA ACCCGACGAA CCCTTCTCAG 
CCTGTGACAC CTAATCCTGC TGATCCTCAT CAACCTGGTA CAGCCGGACC CCTTAGTATT 
GACTATGTTT CAAATATCCA TTTTGGATCA AAACAAATTC AAGCCGGAAC AGCGATCTAT 
TCGGCACAAC TGGATCAAGT GCAAAATAGT ACTGGCGATT TAATTAGCGT GCCAAACTAT 
GTTCAAGTAA CTGACAAACG TGGTCTAAAT CTTGGCTGGA AATTATCAGT TAAACAGAGT 
GCGCAATTTG CTACAAGTGA TTCAACACCC GCTGTTTTGG ATAATGCATC CTTGACCTTT 
TTAGCAGCAA CACCCAATTC AACACAGTTA CTTTCTTTGG CGCCATTAAC GGTCCCAGTA 
ACCTTGGATC CAACTGGTGC CGCCACTTCT CCTGTGGCGA CTGCCGCTCT TTCAACAGGA 
ATCGGCACTT GGACATTAGC TTTTGGTAGC GGANCGACCG CTGCTCAAGG CATTCAATTA 
ACTGTTCCTG CGACAACGAA AAAAGTTGCA GCTAAACAAT ATAAAACAAC GCTTACTTGG 
ATTTTGGATG ATACACCACT 

EF082-4 (SEQ ID NO:312) 

AQ VASIQSNADI TFALDNTVTP PVNPTNPSQP 

VTPNPADPHQ PGTAGPLSID YVSNIHFGSK QIQAGTAIYS AQLDQVQNST GDLISVPNYV 
QVTDKRGLNL GWKLSVKQSA QFATSDSTPA VLDNASLTFL AATPNSTQLL SLAPLTVPVT 
LDPTGAATSP VATAALSTGM GTWTLAFGSG XTAAQGIQLT VPATTKKVAA KQYKTTLTWI 
LDDTP 

EF083-1 (SEQ ID NO:313) 

TAATTTAAAA GACAAGGAGA AATAAAAATG AAAAAGAAAA TTTTAGCAGG AGCGCTTGTC 
GCTCTGTTTT TTATGCCTAC AGCTATGTTT GCCGCAAAAG GAGACCAAGG TGTGGATTGG 
GCGATTTATC AAGGTGAACA AGGTCGCTTT GGCTATGCAC ATGATAAATT GGCTATTGCC 
CAGATTGGAG GCTACAATGC TAGCGGTATT TATGAACAAT ACACATATAA AACGCAAGTG 
GCAAGTGCTA TTGCCCAAGG TAAACGTGCG CATACCTATA TTTGGTATGA CACTTGGGGA 
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AACATCGACA TTCCGAAAAC AACAATGGAT TACTTTTTGC CACGTATTCA AACGCCTAAA 
^tJStcG ?ScATTAGA TTTTOAACAT GGAGCGTIX^G CTAGTGTirC AGATGGATAT 
S^ISatS TAAGTTCAGA TCCCGA^ GCAGCAAATA CAGAGACAAT OTTCTACGGT 
S^?gSgS tcaaIcaggc TCGCTATACT CCAATCTATT AOAGCTATAA gccatttaca 
JSSSaJ^ S^tatca acaaatca-tc aaagagtttc ctaactcttt atggattgct 

GC^S TCGA-K3GTGT GTCACCATAT CCATTGTATC CTTATTTCCC AAGCATGGAT 
SISa TTTGGCAArr CACATCCGCT TATATIX5CAG GTGGTTTAGA TGGTAACGTA 

SSSg gaattacgga TAGTGGTTAT acagatacca ataaaccaga aacggatacg 

SSSg aJSaGGCGA AGAAATTGAA AAAATACCTA ATTCl^TGT TAAAGTTGGC 
GATAC^TCA AAGTGAAATT TAATGTAGAT GCTTCGGCAA CTGGGGAAGC TATTCCGCAA 
SSSg GAAACAGCTA CAAAGTCCAA GAAGTAACTC GAAGCAGAGT AITCCTTGAA 
C^ATTAG CAAAGGT^AT AT1«AATTAT a<3CCAGACGC AACAGTCGTC 

??tgItaagc aaccagaagc gactcatctg gtacaatacg gagaaacatt atcaagtatt 

GCTTATCAAT ATGGAACAGA CTATCAAACG TTCGCGGCAT TAAATGGATT GGCTAATCCA 
AATCTTATTT ATCCTGGTCA AGnTTCAAA GTCAATGGAT CGGCAACAAG TAATCTCTAC 
ACGG-TTAAAT ACGGCGATAA TTTATCTAGT ATTGCAGCAA AACTTGGCAC TACTTATCAA 
GCTTTAGCTG CATTAAACGG ATTAGCAAAT CCTAACTTGA ITTATCCAGG TCAAACATTG 
AATTATTAA 

EF083-2 (SEQ ID NO:314) 

MK KKILAGALVA LFFMPTAMFA AKGDQGVDWA lYQGEQGRFG YAHDKFAIAQ 
^SSS eqytySqva SAIAQGKRAH TYIWYDTWGN MDIAKTIMDY FLPRIQTPKN 

sSSfehg Ssvpdgyg gyvssdaeka antetilygm rrikqagytp myysykpftl 
^^qSS efpnslwiaa ypidgvspyp lyayfpsmdg igiwqftsay iaggldgnvd 

LTGITOSGYT IVrNKPETDTP ATDAGEEIEK IPNSDVKVGD TVKVKFNVDA WATCEAIPQW 

vkgnsykvqe vtcsrvlleg ilswiskgdi ellpdatwp dkqpeathw qygetlssia 
^^?gSl ISnglanpn liypgqvlkv ngsatsnvyt vkygdnlssi aaklgttyqa 

laalnglanp nliypgqtln y 

ef083-3 (seq id no:315) 

AAAAG GAGACCAAGG TGTGGATTGG 

^?T?ATC AAGGTCAACA AGGTCGCTTT GGCTATCCAC ATGATAAATT CGCTA1TGCC 
CAGaSSaG GCTACAATCC TAGCGGTATT TA1X5AACAAT ACACATATAA AACGCAAGTG 
GCAAGTCCTA TTCCCCAAGG TAAACGTCCG CATACCTATA TTTGGTATGA CACTTGGGGA 
SSSaCA SgCGAAAAC AACAATCGAT TACTTTTTCC CACGTATTCA AACGCCTAAA 
^SStcG TTGCATTAGA TTTTGAACAT GGAGCGTTX3G CTAGTGTTCC AGATGGATAT 
GGAGGATATG TAAGTTCAGA TCCCGAAAAA GCAGCAAATA CAGAGACAAT TTTGTACGGT 

a^^SSgS tcaaacaggc tcgctatact ccaatotatt acagctataa gccatttaca 

CTAAATCATG TAAACTATCA ACAAATCATC AAAGAGTTTC CTAACTCTTT ATGGATTGCT 
GC^TCcS JSaTGGTGT GTCACCATAT CCATTCTAT^ CTTATTTCCC AAGCATGGAT 
GgSgTA TTTGGCAA-IT CACATCCGCT TATATTCCAG GT^SGTiTAGA I^TAACGTA 
GATTTAACAG GAATTACGGA TAG-TCGTTAT ACAGATACCA ATAAACCAGA AACGGATACG 
CCAGCAACAG ATCCAGGCGA AGAAATTCAA AAAATACCTA ATTCTGATGT TAAAGTTGGC 
GATACCGTCA AAGTGAAATT TAATCTAGAT GCTTGGGCAA CTGGGGAAGC TATTCCGCAA 
TCGGTAAAAG GAAACAGCTA CAAAGTCCAA GAAGTAACTG GAAGCAGAGT ATTGCTTGAA 
GGTATCTO3T CATCGATTAG CAAAGG1X3AT ATTGAATTAT TGCCAGACGC AACAGTCGTC 
CCTGATAAGC AACCAGAAGC GACTCATGTC GTACAATACG GAGAAACATT ATCAAGTATT 
GCTTATCAAT ATGGAACAGA CTATCAAACG TTGGCGGCAT TAAATGGATT GGCTAATCCA 
AATCTTATTT ATCCTCGTCA AGTTrTOAAA GTCAATGGAT CGGCAACAAG TAATGTCTAC 
ACGGTTAAAT ACGGCGATAA TTTATCTAGT ATTGCAGCAA AACTTGGCAC TACTTATCAA 
GCTTTAGCTG CATTAAACGG ATTAGCAAAT CCTAACTTGA TTTATCCAGG TCAAACATTG 
AAT 
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ef083-4 (seq id no: 316) 

s??IlSeS aSdgyg gyvssdaeka antetilygm rrikqagytp myysykpftl 
^ooTTK eSnslwiaa ypidgvspyp lyayfpsmdg igiwqftsay iaggldgnvd 
SdsgS SpSJS I^ageeiek ipnsdvkvgd wkvkfnvda watgeaipqw 

SSSqe SJgsrvlleg ILSWISKGDI ELLPDATWP DKQPEATHW QVCETL^IA 
^SSqS aalnglanpn liypgqvlkv. NGSATSNVYT VKYGDNLSSI AAKLGTTYQA 
LAAIiNGLANP NLIYPGQTLN 

EF084-1 (SEQ ID NO:317) 

TAGTCAAACG TTTATTTrTT CCTTAAATCC AGAAAAAATC CCGTAATTAT GGTACACTAC 
cSS^^S SIggagaac TATGAAGAAA TTTCATGTAA TTATTCTCGG ^^ctgggacg 
aSgSmga tcgccacgat tgcggccgcc gaagcaggcg ctcaagtatt attgattgaa 

SSSS ^JSgGAA AAAATTATTA Al^SAC^TC GCGGCCGCTG TAATGTAACC 
^TCGGC CCGCAGAAGA AATCATTTCA TTTATTCCTC GGAATGGAAA A™ATAC 
SSSt CACAATTTCA TAACTATCAT ATCATGAACT TTrrTCAATC CAATGGTATT 

caSSg aagaagatca cggacgcatg ttccctctta cagataaatc gaagtcaatt 

gS^S^? TATTTAACCG CAITAACGAA TTAGGAGTCA CTGTTrrrAC aaaaacacag 
G^ScSSt TACTACGAAA AGACGATCAA ATAATltSGCG TTGAAACCGA ACTGGAAAAA 
aSa^^C CGTGTGTTGT ATTAACAACT GGCGGCCGCA crrATCCTTC CACAGGAGCA 

A^TCATC gcStaaact agccaaaaaa atcgggcata ccatcagccc gctctaccct 

AcSSI? cSS^ TGAAGAACCT TTTATCCrGG ATAAAACGTT GCAAGGTCTC 
tcS^SJg aSSStTT AAC-rcTTTTG AACCAAAAAG GAAAACCTTT AGTTAATCAT 
SSSSa StTTAC ACATITTXSGC ATTTCAGGAC CTGCCGCGCT CCGCTGTTCT 
aS^SStTA ACCAAGAATT AACTCGCAAC GGTAATCAAC CTGTCACGGT AGCCrrGGAT 

cSSa ^Saatco^ ox^aagaagtg cco^ccaaac aactaacaga ^caacgn 

CTTTCCTTTG TGGAACTACT GAAAGACTTT CAGTTCACTG TTACGAAAAC ATTGCCTTTG 
PAAAAATCTT TTGTCACAGG CGGTCGGATT TCCCTCAAAG AAGTGACCCC TAAAACAATG 
G^^IS? JStcS^GG TTTArrXTTT GC1«G1X5AAC TTTTAGATAT TAATGGCTAT 
AcSgA^? ACAATGTTAC AGCTCCATTT GTCACTGGAC ATGIT^CTCG CTCCCATGCC 
GCAGAAATTC CAGAATACAC CTATTTACCA ATTGAAGAAG TCTAA 

EF084-2 (SEQ ID NO:318) 

S= JSS 

lodSltvln qkgkplvnhq mdmlfthfgi sgpaalrcss finqeltrng nqpvtvaldv 

FpS^ SqSeZl SPVELLKDFQ FTV^TLPLE KSFVTGGGIS LJEVrPKIME 
SKLVNGLFFA GELLDINGYT GGYNVTAAFV TGHVAGSHAA eiaeytylpi eev 

EF084-3 (SEQ ID NO:319) 

C GAAGCAGGCG CTCAAGTATT ATTGATTGAA 

L^tcS GTGTTGGGAA AAAATTATTA ATCACmGTG GCGGCCGCTG TAATGTAACC 
J^tcS CCGCAGAAGA AATCATTTCA TTTATTCCTG GGAATGGAAA A^ATAC 

^gcgcatSt cacaatttca taactatgat atcatcaact tttitgaatc caatggtatt 
ScSJIIS SgJ^^a cggacgcato ttccctgtta cagataaatc gaagtcaatt 

f^^OZ JaTTTAACCG CATTAACGAA TTAGGAGTCA CaXSTrTTTAC AAAAACACAG 
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GTCACAAAAT TACTACGAAA AGACGATCAA ATAATTGGCG TTGAAACCGA ACTGGAAAAA 
ATTTATGCAC CGTGTCTTGT ATTAACAACT GGCGGCCGCA CTTATCCTTC CACAGGAGCA 
ACTGGTGATG GCTATAAACT AGCCAAAAAA ATGGGGCATA CCATCAGCCC GCTCTACCCT 
ACCGAATCAC CTATTATTTC TGAAGAACCT TTTATCCTGG ATAAAACGTT GCAAGGTCTC 
TCTTTACAAG ATGTTAATTT AACTGTTTTG AACCAAAAAG GAAAACCTTT AGTTAATCAT 
CAAATGGATA TGCTGTTTAC ACATTTTGGC ATTTCAGGAC CTGCCGCGCT CCGCTGTTCT 
AGTTTTATTA ACCAAGAATT AACTCGCAAC GGTAATCAAC CTGTCACGGT AGCCTTGGAT 
GTGTTTCCGA CAAAATCTTT TGAAGAAGTG CCTGCCAAAC AACTAACAGA AAAGCAACGN 
CTTTCCTTTG TGGAACTACT GAAAGACTTT CAGTTCACTG TTACGAAAAC ATTGCCTTTG 
GAAAAATCTT TTGTCACAGG CGGTGGGATT TCCCTCAAAG AAGTGACCCC TAAAACAATG 
GAGAGCAAAT TAGTCAATGG TTTATTTTTT GCTGGTGAAC TTTTAGATAT TAATGGCTAT 
ACTGGAGGCT ACAATGTTAC AGCTGCATTT GTCACTGGAC ATGTTGCTGG CTCCCATGCC 
GCAGAAATTG CAGAATACAC CTATTTACCA ATTGAAGAAG TC 

EF084-4 (SEQ ID NO:320) 

E AGAQVLLIEK 

NRRVGKKLLM TGGGRCNVTN NRPAEEIISF IPGNGKFLYS AFSQFDNYDI MNFFESNGIH 
LKEEDHGRMF PVTDKSKSIV DALFNRINEL GVTVFTKTQV TKLLRKDDQI IGVETELEKI 
YAPCWLTTG GRTYPSTGAT GDGYKLAKKM GHTISPLYPT ESPIISEEPF ILDKTLQGLS 
LQDVNLTVLN QKGKPLVNHQ MDMLFTHFGI SGPAALRCSS FINQELTRNG NQPVTVALDV 
FPTKSFEEVP- AKQLTEKQRL SFVELLKDFQ FTVTKTLPLE KSFVTGGGIS LKEVTPKTME 
SKLVNGLFFA GELLDINGYT GGYNVTAAFV TGHVAGSHAA EIAEYTYLPI EEV 

EF085-1 (SEQ ID NO:321) 

TAACCCATGA AATCATTTTG TCCCGCATAT GGGGATATGA CTTTGACGGT GATGGCAGCA 
CAGTCCACAC TCATATCAAA AATCTGCGGG CGAACTGCCG GAAAATATCA TCAAAACCAT 
CCGGGGTGTA GGTTACCGAT TGGAGGAATC ATTATAATGG AAAGAAAAGG GATTTTCATT 
AAGGTTTTTT CCTATACGAT CATTGTCCTG TTACTGCTTG TCGGTGTAAC GGCAACACTG 
TTTGCACAGC AATTTGTGTC TTATTTCAGA GCGATGGAAG CACAGCAAAC AGTAAAATCC 
TATCAGCCAT TGGTGGAACT GATTCAGAAT AGCGATAGGC TTGATATGCA AGAGGTGGCA 
GGGCTGTTTC ACTACAATAA CCAATCCTTT GAGTTTTATA TTGAAGATAA AGAGGGAAGC 
GTACTCTATG CCACACCGAA TGCCGATACA TCAAATAGTG TTAGGCCCGA CTTTCTTTAT 
GTGGTACATA GAGATGATAA TATTTCGATT GTTGCTCAAA GCAAGGCAGG TGTGGGATTG 
CTTTATCAAG GGCTGACAAT TCGGGGAATT GTTATGATTG CGATAATGGT TGTATTCAGC 
CTTTTATGCG CGTATATCTT TGCGCGGCAA ATGACAACGC CGATCAAAGC CTTAGCGGAC 
AGTGCGAATA AAATGGCAAA CCTGAAAGAA GTACCGCCGC CGCTGGAGCG AAAGGATGAG 
CTTGGCGCAC TGGCTCACGA CATGCATTCC ATGTATATCA GGCTGAAAGA AACCATCGCA 
AGGCTGGAGG ATCAAATCGC AAGGGAACAT GAGTTGGAGG AAACACAGCG ATATTTCTTT 
GCGGCAGCCT CTCATGAGTT AAAAACGCCC ATCGCGGCTG TAAGCGTTCT GTTGGAGGGA 
ATGCTTCAAA ATATCGGTGA CTACAAAGAC CATTCTAAGT ATCTGCGCGA ATGCATCAAA 
ATGATGGACA GGCAGGGCAA AACCATTTCC GAAATACTGG AGCTTGTCAG CCTGAACGAT 
GGGAGAATCG TACCCATAGC CGAACCGCTG GACATAGGGC GCACGGTTGC CGAGCTGCTA 
CCCGATTTTC AAACCTTGGC AGAGGCAAAC AACCAGCGGT TCGTCACAGA TATTCCAGCC 
GGACAAATTG TCCTGTCCGA TCCGAAGCTG ATCCAAAAGG CGCTATCCAA TGTCATATTG 
AATGCGGTTC AGAACACGCC CCAGGGAGGT GAGGTACGGA TATGGAGTGA GCCTGGGGCT 
GAAAAATACC GTCTTTCCGT TTTGAACATG GGCGTTCACA TTGATGATAC TGCACTTTCA 
AAGCTGTTCA TCCCATTCTA TCGCATTGAT CAGGCGCGAA GCAGCAAAAA GTGGGCGAAG 
CGGTTTGGGG CTTGCCATCG TACAAAAAAC GCTGGATGCC ATGAGCCTCC AATATGCGCT 
GGAAAACACC TCAGATGGCG TTTTGTTCTG GCTGGATTTA CCGCCCACAT CAACACTATA 
AATATTTAA 
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EF085-2 (SEQ ID NO:322) 

wS™L LLVGVTATLF AQQFVSYFRA MEAQQTVKSY QPLVELIQNS DRLDMQEVAG 
LFHYNNOSFE FYIEDKEGSV LYATPNADTS NSVRPDFLYV VHRDDNISIV AQSKAGVGL.L 
ySISrgiv MIAIMWFSL LCAYIFARQM TTPIKALADS ANKMANLKEV PPPLERKDEL 

SdSS SSSiar ledeiarehe leetqryffa aashelktpi aavsvl^egm 

IfmOBYKm SKYLRECIKM MDRQGKTISE ILELVSLNDG RIVPIAEPLD XGR^VAELLP 

dfotiaeann qrfvtdipag qivlsdpkli qkalsnviln avqntpqgge vriwsepgae 
kyrlS^ ?S^alsk £fipfyridq arsskkwakr fgachrt^na gcheppicag 
khlrwrfvla gftahintin i 

ef085-3 (seq id no: 323) 

rr AATTTGTGTC TTATTTCAGA GCGATGGAAG CACAGCAAAC AGTAAAATCC 
TAT^^?^?SgSgScT GArrcAGAAT AGCGATAGGC TTGATATGCA AGAGGTGGCA 
S^CTCTTTc Sacaataa CCAATCCTTT GAGTTTTATA TTGAAGATAA AGAGGGAAGC 

SctcS^S SaJ^gaa tcccgataca tcaaatagtg ttaggcccga ctttctttat 
gJ^ScaS gagatgataa tatttcgatt gitcctcaaa gcaaggcagg tc-igggattc 
cSScJIg gScaat tcggggaatt gttatgattg cgataatggt tctattcagc 
cS^^G cgtSatctt tgcgcggcaa atcacaacgc cgatcaaagc cttagcggac 
aSSS Satggcaaa cctgaaagaa gtaccgccgc cgctggagcg aaaggatgag 
c?S?Sc JSScACGA catgcattcc atgtatatca ggctcaaaga aaccatcgca 
aSSgagg atgaaatcgc aagggaacat gagttcgagg aaacacagcg atatttctit 

gSgSgcS CTC^TGAGTr AAAAACGCCC ATCGCGGCTG TAAGCGTTCT GTTGGAGGGA 

aSS^SI SaJStga ctacaaagac cattctaagt atctccgcga atgcatcaaa 

iSA^iS GGCAGGGCAA AACCATTTCC GAAATAC1X3G AGCTTGTCAG CCTOAACGAT 

gS^Sg tacccatagc cgaaccgctc gacatagggc gcacggttgc cgagctgcta 
?c?GA?Sc Iaaccttcgc agaggcaaac aaccagcggt tcgtcacaga tattccagcc 
SacSS tcctgtccga tccgaagctc atccaaaagg cgctatccaa tgtcatattg 
Stc^Jtc agaacacgcc ccagggaggt gaggtacgga tatggagtga gcctggggct 

^^Zc GTCTTTCCGT TI^AACAI.. GGCGTIK^ACA TTG^TGATAC TGC^^^A 
AfiGCTGTTCA TCCCATTCTA TCGCATTCAT CAGGCGCGAA GCAGCAAAAA GTGGGCGAAG 
Sg^SS cScStcg TACAAAAAAC GCTCGATCCC ATGAGCCTCC AATATGCGCT 

Z^I^c t^Igaox^gcg tt^ttctg gco^at^a ccgcccacat caacactata 

AATATTT 

EF085-4 {SEQ ID NO:324) 

mn/ciYFRA MEAOOTVKSY QPLVELIQNS DRLDMQEVAG 

LSSFf^?™GSV LYATPNADTTS NSVRPDFLYV VHRDDNISIV AQSKAGVGLL 
YoSgIV MIAiSa^FSL LCAYIFARQM TTPIKALADS ANKMANLKEV PPPLERKDEL 
Sd^SM ^^^SiAR LEDEIAREHE LEETQRYFFA AASHELKTPI AAVSVLL^M 
!SSSSdH SKYLRECIKM MDRQGKTISE ILELVSLNDG RIVPIAEPLD IGRTVAELLP 
DfSSS QR^^5pS QIv£sDPKLI QKALSNVILN AVQNTPQGGE VRIWSEPGAE 
KyS^ ?h^ALSK LFIPFYRIDQ arsskkwakr FGACHRTKNA GCHEPPICAG 
KHLRWRFVLA GFTAHINTIN I 

EF086-1 (SEQ ID NO:325) 

TAACTGGTCG GATTCGCAAA TTGGTrCCGC GCAGCGCTAA CAGATACATT GA-mTATTA 

cJ?SItcaS ^attgaatac agatccagaa aaattaaata aatttactgc tccgctgatg 
c^SaScS Sgatccaaa catacaaox^g ccaatttai^ l^^l^ 
acagatattt caatcaccgt tttaggtact ggacttttgt tagaagataa tcaacgccta 
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GTACAAGTAC AAGAAGCTGT TCCGTCCGTT TTAAAAAGTG TTTCCTCTGG TGATGGCTTA 
TATCCTGATG GTTCCTTGAT TCAACATGGT TATTTTCCGT ACAACGGCAG TTACGGGAAT 
GAGTTGCTAA AAGGGTTTGG ACGAATTCAG ACTATTTTAC AAGGTTCCGA CTGGGAGATG 
AATGACCCTA ACATTAGTAA TTTATTTAAT GTTGTGGATA AAGGTTACTT ACAATTGATG 
GTAAATGGAA AAATGCCATC GATGGTTTCT GGTAGAAGTA TTTCCAGAGC GCCAGAAACG 
AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAAC7U\ TAGCTAATTT AACCTTAATT 
GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA CATCTATCCA AACGTGGCTT 
CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATTTTGA AGCGTTAATT 
GACTTGAAAA ATGTAGTGAA TAGTGCGTCA CCTGCCCAAG CGACACCAAT GCAATCTTTA 
AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA ACGAATATGC GGTGGGGATC 
AGTATGTATT CACAACGTGT CGGAAACTAT GAATTTGGGA ATACGGAAAA TAAAAAAGGC 
TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG ACTTTGCTCA GTTTGATGAA 
GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGGAA CGACAGTTGA CACAAGAGAA 
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGGGTAGG TGGCTCAAAT 
AATGGACAGG TTGGCTCTAT AGGAATGTTT TTAGATAAAA GTAATGAAGG AATGAACTTA 
GTTGCTAAAA AATCTTGGTT CTTATTAGAT GGTCAAATCA TTAATTTGGG AAGTGGCATT 
ACTGGTACGA CAGATGCTTC GATTGAAACA ATCCTCGATA ATCGGATGAT TCATCCACAG 
GAAGTGAAGC TTAACCAAGG TTCAGACAAA GATAATTCTT GGATTAGTTT AAGCGCAGCG 
ANTCCATTGA ATAACATTGG CTATGTTTTT CCTAATTCNA TGAATACGCT TGATGTTCAA 
ATAGAAGAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA TGATAAAACC 
TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGGCA AGACTGTTGA AAATGGTACT 
TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCGCAGCTCT TTCTAAAAAC 
AAAGGCTATA CTCTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC AGGTAATTAT 
GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GACTGTATGC GTATGATCCA 
ATCTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCTTAACTCT TGCGAATCCT 
TTACAAAATA ATGCATCCGT TTCTATTGAA TTTGATAAGG GCATTCTTGA AGTAGTCGCA 
GCGGACCCAG AAATTTCTGT TGACCAAAAT ATTAOXrACTT TAAATAGTGC GGGGTTAAAT 
GGCAGCTCGC GTTCAATCAT TGTTAAAACA ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 
AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG CAGCTGGAAA 
GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC AGCAACGCAA 
GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT GGTAAAAGTG 
CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 
CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT GAAGCAAGCG 
CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC AGAAGCAAAA 
CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA AAAGGAGCAA 
AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA AACTGGTACG 
AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT TTTACCGAGC 
ACAGGAGAAA AGAAATCGAT CGCGCTTGTG ATTATTGGTC TTCTAGTTAT CGCCAGTGGG 
TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA 

EF086-2 (SEQ ID NO:326) 

LVGLANWFRA ALTDTLILLH DDLLNTDAEK LNKFTAPLML YAKDPNIQWP lYRATGANLT 
DISITVLGTG LLLEDNQRLV QVQEAVPSVL KSVSSGDGLY PDGSLIQHGY FPYNGSYGNE 
LLKGFGRIQT ILQGSDWEMN DPNISNLFNV VDKGYLQLMV NGKMPSMVSG RSISRAPETN 
PFTTEFESGK ETIANLTLIA KFAPENLRND lYTSIQTWLQ QSGSYYHFFK KPRDFEALID 
LKNWNSASP AQATPMQSLN VYGSMDRVLQ KNNEYAVGIS MYSQRVGNYE FGNTENKKGW 
HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTTVDTREL ANGAYTGKRS PQSWVGGSNN 
GQVASIGMFL DKSNEGMNLV AKKSWFLLDG QIINLGSGIT GTTDASIETI LDNRMIHPQE 
VKLNQGSDKD NSWISLSAAX PLNNIGYVFP NSMNTLDVQI EERSGRYGDI NEYFVNDKTY 
TNTFAKISKN YGKTVENGTY EYLTWGKTN EEIAALSKNK GYTVLENTAN LQAIEAGNYV 
MMNTWNNDQE lAGLYAYDPM SVISEKIDNG VYRLTLANPL QNNASVSIEF DKGILEWAA 
DPEISVDQNI ITLNSAGLNG SSRSIIVKTT PEVTKEALEK LIQEQKEHQE KDYTASSWKV 
YSEALKQAQT VADQTTATQA EVDQAETELR SAVK^^LVKVP TKEVDKTNLL KIIKENEKHQ 
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.KBVT^SWK -^--0 

NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQtbl't>x 

LIjVFRKSKSK K 

EF086-3 ■ (SEQ ID NO:327) 

CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATTTTGA J^^^^ 
I^^Zk ATGTAGTCAA TAG-reCGTCA CCTGCCCAAG CGACACCAAT GCAATCTTTA 
S^TATA^ g™a .™A CAGAAAAA^ ACGAA..JOC GG^^T^ 
AGTATGTATT CACAACGTGT CGGAAAUiAi ArTTTGCTCA GTTTCATGAA 



AAT 

EF086-4 (SEQ ID NO:328) 



=£=£E= = = = 

EF087-1 (SEQ ID NO:329) 

TAACTGG^G GArTGGCAAA ^rrCCGC GCAGCGCTAA CAGATACATT 
^^fTGACC TA.I.AATAC AGATCCAGAA AAATTAAA.A AATTTAC^C TCCGCTGA^ 
CTGTATGCAA AAGATCCAAA CATACAATGG CCAATTTATC JTG ^^^^cCTA 
ACAGATATTT CAATCACCGT TTTAGGTACT GGACTTTTGT tcATCGCTTA 
GTACAAGTAC AAGAAGCTGT TCCGTCCGTT ^TAAAAAGTG ^^^^ ^TAGGGGAAT 
TATCCTGATC GTTCCTTGAT TCAACATGGT TATTTTCCGT ^CAACGGCA 
GAcrrGCTAA AAGGGTTTGG ACGAATTCAG ACTATTTTAC AAOGTTCCGA 
AAICACCCTA ACATTAGTAA TTTAITTAAT ^T^^J^ ^^^SSc ScCAGAAACG 
GTAAATGGAA AAATGCCATC GATGGTTTCT ^GTAGAAGTA TTTCCAGA^ ;^cCTTAATT 
AATCCTTTTA CTACAGAGTT TCAATCGGGT AAAGAAACAA TAGCTAATTa 
GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA CA^TATCCA ^^G 
CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATTTl'OA ^ 

s= = iii = i= = 
— i= iii i= ?s 

GGATACTGGG CAACGATCGA TCCATATCtA CATCGGTAGG TCGCTCAAAT 

TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT ^ATGW^iii^ 

"™ ISSS? S= S=S S^SS SSScS 

ssss? .t™ ™- ~- 

SttS?^ ™^ 

=i iii = s = 
= s= — 

=s s= i^^s^ =i ~ 

TTACAAAATA ATCCATCCGT TTCTATTGAA TTTGATAAGG GCATTCTTGA 
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GCGGACCCAG AAATTTCTCT TCACCAAAAT ATTATCACTT TAAATAGTCC GGGGTTAAAT 
J^^^rTrGC GTTCAATCAT TGTTAAAACA ACTCCTCAAG TAACGAAAGA AGCGTTAGAA 
SSSc a^aJ^ISI AGAACACCAA GAAAAAGACT ACACCGCAAG cagct^aaa 

mmmmmm 

rJJcGTTCGG CAGTCAAGCG ATTAACATTC AAAAATAGTG GGGAAAATAA AAAGGAGCAA 

SS^^SS ™^ 3-^- -s^- 

^S^SS -™ ^'^^ ^^'^■^ 

TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA 



EF087-2 (SEQ ID NO:330) 

LVGLANWFRA ALTDTLILLH DDLLNTDAEK 
DISITVIiGTG LLLEDNQRLV QVQEAVPSVL 
LLKGFGRIQT ILQGSDWEMN DPNISNLFNV 
PFTTEFESGK ETIANLTLIA KFAPENLRND 
LKNWNSASP AQATPMQSLN VYGSMDRVLQ 
HTADGMLYLY NQDFAQFDEG YWATIDPYRL 
GQVASIGMFL DKSNEGMNLV AKKSWFLLDG 
VKLNQGSDKD NSWISLSAAX PLNNIGYVFP 
TNTFAKISKN YGKTVENGTY EYLTWGKTN 
MMNTWNNDQE lAGLYAYDPM SVISEKIDNG 
DPEISVDQNI ITLNSAGLNG SSRSIIVKTT 
YSEALKQAQT VADQTTATQA EVDQAETELR 
EKDYTASSWK VYSEALKQAQ TVADQTTATQ 
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF 
LLVFRKSKSK K 

EF087-3 (SEQ ID NO:331) 



LNKFTAPLML YAKDPNIQWP lYRATGANLT 
KSVSSGDGLY PDGSLIQHGY FPYNGSYGNE 
VDKGYLQLMV NGKMPSMVSG RSISRAPETN 
lYTSIQTWLQ QSGSYYHFFK KPRDFEALID 
KNNEYAVGIS MYSQRVGNYE FGNTENKKGW 
PGTTVDTREL ANGAYTGKRS PQSWVGGSNN 
QIINIiGSGIT GTTDASIETI LDNRMIHPQE 
NSMNTLDVQI EERSGRYGDI NEYFVNDKTY 
EEIAALSKNK GYTVLENTAN LQAIEAGNYV 
VYRLTLANPL QNNASVSIEF DKGILEWAA 
PEVTKEALEK LIQEQKEHQE KDYTASSWKV 
SAVKQLVKVP TKEVDKTNLL KIIKENEKHQ 
AEVDQAEAKL RSAVKRLTLK NSGENKKEQK 
RKASQFLPST GEKKSIALVI IGLLVIASGC 



A ATCGGATGAT TCATCCACAG 
GAAGTGAAGC TTAACCAAGG TTCAGACAAA 
ANTCCATTGA ATAACATTGG CTATGTTTTT 
ATAGAAGAAC GCTCTGGTCG CTACGGAGAT 
TATACAAATA CATTTGCTAA AATTAGTAAA 
TACGAATATT TAACAGTGGT TGGGAAAACG 
AAAGGCTATA CTGTTCTAGA AAATACAGCA 
GTCATGATGA ATACATGGAA TAATGACCAA 
ATGTCGGTTA TTTCAGAAAA AATTGATAAC 
TTACAAAATA ATGCATCC 



GATAATTCTT GGATTAGTTT AAGCGCAGCG 
CCTAATTCNA TGAATACGCT TGATGTTCAA 
ATTAACGAAT ACTTTGTTAA TGATAAAACC 
AATTATGGCA AGACTGTTGA AAATGGTACT 
AATGAAGAAA TCGCAGCTCT TTCTAAAAAC 
AACTTACAAG CCATTCAAGC AGGTAATTAT 
GAAATTGCAG GACTGTATGC GTATGATCCA 
GGTGTTTATC GCTTAACTCT TGCGAATCCT 



EF087-4 (SEQ ID NO: 332) 



NRMIHPQE 

VKLNQGSDKD NSWISLSAAX 
TNTFAKISKN YGKTVENGTY 
MMNTWNNDQE lAGLYAYDPM 



PLNNIGYVFP NSMNTLDVQI 
EYLTWGKTN EEIAALSKNK 
SVISEKIDNG VYRLTLANPL 



EERSGRYGDI NEYFVNDKTY 
GYTVLENTAN LQAIEAGNYV 
QNNAS 



EF088-1 (SEQ ID NO:333) 
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TAACTGGTCG GATTGGCAAA TTGGTTCCGC GCAGCGCTAA CAGATACATT GATTTTATTA 
CATGATGACC TATTGAATAC AGATGCAGAA AAATTAAATA AATTTACTGC TCCGCTGATG 
CTGTATGCAA AAGATCCAAA CATACAATGG CCAATTTATC GTGCAACAGG AGCTAACTTA 
ACAGATATTT CAATCACCGT TTTAGGTACT GGACTTTTGT TAGAAGATAA TCAACGCCTA 
GTACAAGTAC AAGAAGCTGT TCCGTCCGTT TTAAAAAGTG TTTCCTCTGG TGATGGCTTA 
TATCCTGATG GTTCCTTGAT TCAACATGGT TATTTTCCGT ACAACGGCAG TTACGGGAAT 
GAGTTGCTAA AAGGGTTTGG ACGAATTCAG ACTATTTTAC AAGGTTCCGA CTGGGAGATG 
AATGACCCTA ACATTAGTAA TTTATTTAAT GTTGTGGATA AAGGTTACTT ACAATTGATG 
GTAAATGGAA AAATGCCATC GATGGTTTCT GGTAGAAGTA TTTCCAGAGC GCCAGAAACG 
AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAACAA TAGCTAATTT AACCTTAATT 
GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA CATCTATCCA AACGTGGCTT 
CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATTTTGA AGCGTTAATT 
GACTTGAAAA ATGTAGTGAA TAGTGCGTCA CCTGCCCAAG CGACACCAAT GCAATCTTTA 
AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA ACGAATATGC GGTGGGGATC 
AGTATGTATT CACAACGTGT CGGAAACTAT GAATTTGGGA ATACGGAAAA TAAAAAAGGC 
TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG ACTTTGCTCA GTTTGATGAA 
GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGGAA CGACAGTTGA CACAAGAGAA 
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGGGTAGG TGGCTCAAAT 
AATGGACAGG TTGCCTCTAT AGGAATGTTT TTAGATAAAA GTAATGAAGG AATG/lACTTA 
GTTGCTAAAA AATCTTGGTT CTTATTAGAT GGTCAAATCA TTAATTTGGG AAGTGGCATT 
ACTGGTACGA CAGATGCTTC GATTGAAACA ATCCTCGATA ATCGGATGAT TCATCCACAG 
GAAGTGAAGC TTAACCAAGG TTCAGACAAA GATAATTCTT GGATTAGTTT AAGCGCAGCG 
ANTCCATTGA ATAACATTGG CTATGTTTTT CCTAATTCNA TGAATACGCT TGATGTTCAA 
ATAGAAGAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA TGATAAAACC 
TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGGCA AGACTGTTGA AAATGGTACT 
TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCGCAGCTCT TTCTAAAAAC 
AAAGGCTATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC AGGTAATTAT 
GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GACTGTATGC GTATGATCCA 
ATGTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCTTAACTCT TGCGAATCCT 
TTACAAAATA ATGCATCCGT TTCTATTGAA TTTGATAAGG GCATTCTTGA AGTAGTCGCA 
GCGGACCCAG AAATTTCTGT TGACCAAAAT ATTATCACTT TAAATAGTGC GGGGTTAAAT 
GGCAGCTCGC GTTCAATCAT TGTTAAAACA ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 
AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG CAGCTGGAAA 
GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC AGCAACGCAA 
GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT GGTAAAAGTG 
CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 
CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT GAAGCAAGCG 
CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC AGAAGCAAAA 
CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA AAAGGAGCAA 
AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA AACTGGTACG 
AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT TTTACCGAGC 
ACAGGAGAAA AGAAATCGAT CGCGCTTGTG ATTATTGGTC TTCTAGTTAT CGCCAGTGGG 
TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA 

EF088-2 (SEQ ID NO:334) 

LVGLANWFRA ALTDTLILLH DDLLNTDAEK LNKFTAPLML YAKDPNIQWP lYRATGANLT 
DISITVLGTG LLLEDNQRLV QVQEAVPSVL KSVSSGDGLY PDGSLIQHGY FPYNGSYGNE 
LLKGFGRIQT ILQGSDWEMN DPNISNLFNV VDKGYLQLMV NGKMPSMVSG RSISRAPETN 
PFTTEFESGK ETIANLTLIA KFAPENLRND lYTSIQTWLQ QSGSYYHFFK KPRDFEALID 
LKN\AmSASP AQATPMQSLN VYGSMDRVLQ KNNEYAVGIS MYSQRVGNYE FGNTENKKGW 
HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTTVDTREL ANGAYTGKRS PQSWVGGSNN 
GQVASIGMFL DKSNEGMNLV AKKSWFLLDG QIINIiGSGIT GTTDASIETI LDNRMIHPQE 
VKLNQGSDKD NSWISLSAAX PLNNIGYVFP NSMNTLDVQI EERSGRYGDI NEYFVNDKTY 
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TNTFAKISKN YGKTVENGTY EYLTWGKTN EEIAALSKNK GYTVLENTAN LQAIEAGNYV 
MMNTWNNDQE lAGLYAYDPM SVISEKIDNG VYRLTLANPL QNNASVSIEF DKGILEWAA 
DPEISVDQNI ITLNSAGLNG SSRSIIVKTT PEVTKEALEK LIQEQKEHQE KDYTASSWKV 
YSEALKQAQT VADQTTATQA EVDQAETELR SAVKQLVKVP TKEVDKTNLL KIIKENEKHQ 
EKDYTASSWK VYSEALKQAQ TVADQTTATQ AEVDQAEAKL RSAVKRLTLK NSGENKKEQK 
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST GEKKSIALVI IGLLVIASGC 
LLVFRKSKSK K 

EF088-3 (SEQ ID NO:335) 

A ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 

AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG CAGCTGGAAA 
GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC AGCAACGCAA 
GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT GGTAAAAGTG 
CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 
CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT GAAGCAAGCG 
CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC AGAAGCAAAA 
CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA AAAGGAGCAA 
AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA AACTGGTACG 
AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT TTTACCGAGC 
ACAGGAGAAA AGAAA 



EF088-4 (SEQ ID NO:336) 



T PEVTKEALEK LIQEQKEHQE KDYTASSWKV 

YSEALKQAQT VADQTTATQA EVDQAETELR SAVKQLVKVP TKEVDKTNLL KIIKENEKHQ 
EKDYTASSWK VYSEALKQAQ TVADQTTATQ AEVDQAEAKL RSAVKRLTLK NSGENKKEQK 
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST GEKK 



EF089-1 (SEQ ID NO:337) 



TGACAGATAC ACCTGCTAAC 
TATAGGTCAA AAATTTTTTG 
AATGACAGAC ATAGGAGAAT 
TGTATGTTAT TTGGCTGGAT 
ACACCAACAA TTCCCGAAAA 
GCGCCTGGTG CCAAACAAAC 
ACCATTGAAA ATACGGTGAA 
CAAAACGGGA TCAAACCTGA 
CCGAAAGAAA TCATCTTGCC 
CCTAAAGATT CTTTTGATGG 
GAAACAACGA CTTCTGCGGA 
GTTGTGGCTA TTATTCTTCA 
GGGGTTAAAC CAGGCCAAGT 
CAAGCGGCCT ATTTAAACCA 
CTTTACCAAT CCGATACTGA 
ATTTCTTTAA AAGGGGAACG 
GGTGTAAAAG ATGAAAAGGG 
CTGTACAAAT GGGAATTTAC 
AATGAAAAAG ACGTAACCAT 
ATCATTCTAG CGCTGCTCTT 
GAACAACAAT CTGAGCAATA 



ACAGGAAACT AAGAACGACA 
GCTTATCTTT CGGTCTTTTG 
GAATATGAAC AGATGGAAAG 
TGGCGTGGAG GCGCACGCTT 
TCAAGTGGAT AAATCAAAAA 
CGTAGAAATT CAGTTACGCA 
CTCAGCGACA ACAAATTTAA 
CAAAACCTTA CGTTTTAACT 

GAAGCArrrcc caaaagacct 

CGTGATGGCT GGCGGTATAA 
TCAATCAAAA GGGTTAGCTA 
GCAAAATGAG ACAAAGGTTC 
CAACGCGCGA AACGTCATCA 
ATTACATTTA ATCAACACTG 
GGATATGCAA GTGGGGCCAA 
ATTAACGCCA GGAAAATATG 
CACCTATCAA GTCAAAGGCG 
AAAAGAATTT ACTATTTCTG 
TAAAGGAACC AATTGGTGGT 
ATTGATTTTC TTCTTGTATC 
A 



GCATACACGC AAGATCGGGA 
GTGCTTATAA TACAACAAAG 
TATATGCAAC GGTAATCGCT 
CTGAATTTAA TTTTGCGGTC 
CCTACTTTGA CTTAAAAATG 
ATGATACAGA TGAAGACATT 
ATGGCGTAGT AGAATATGGC 
TAAAAGATTA tgtggaagca 
tacctttaac cattacgatg 
cactcaaaga gaaaaagaaa 
ttaataatga atactcctat 
aaccagattt aaaattactg 

ATGTTTCTTT ACAAAACCCA 
TTTCAAAAGG AGGCGAAACG 
ACTCTAACTT TAGTTACCCA 
TCTTGAAATC AACGGCCTAT 
CCAATGGTGA AGAACGGTAC 
GGGACGTCGC TAAAGAATTA 
TGTATCTACT GATTGCATTA 
GTAAAAAGAA AAAAGAGGAA 



EF089-2 (SEQ ID NO:338) 
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MNR WKVYATVIAC 

MLFGWIGVEA HASEFNFAVT PTIPENQVDK SKTYFDLKMA PGAKQTVEIQ LRNDTDEDIT 
lENTVNSATT NLNGWEYGQ NGIKPDKTLR FNLKDYVEAP KEIILPKHSQ KTLPLTIIMP 
KDSFDGVMAG GITLKEKKKE TTTSADQSKG LAINNEYSYV VAIILQQNET KVQPDLKLLG 
VKPGQVNARN VINVSLQNPQ AAYLNQLHLI NTVSKGGETL YQSDTEDMQV APNSNFSYPI 
SLKGERLTPG KYVLKSTAYG VKDEKGTYQV KGANGEERYL YKWEFTKEFT ISGDVAKELN 
EKDVTIKGTN WWLYLLIALI ILALLLLIFF LYRKKKKEEE QQSEQ 

EF089-3 (SEQ ID NO:339) 
T CTGAATTTAA TTTTGCGGTC 

ACACCAACAA TTCCCGAAAA TCAAGTGGAT AAATCAAAAA CCTACTTTGA CTTAAAAATG 
GCGCCTGGTG CCAAACAAAC CGTAGAAATT CAGTTAGGCA ATGATACAGA TGAAGACATT 
ACCATTGAAA ATACGGTGAA CTCAGCGACA ACAAATTTAA ATGGCGTAGT AGAATATCGC 
CAAAACGGGA TCAAACCTGA CAAAACCTTA CGTTTTAACT TAAAAGATTA TGTGGAAGCA 
CCGAAAGAAA TCATCTTGCC GAAGCATTCC CAAAAGACCT TACCTTTAAC CATTACGATG 
CCTAAAGATT CTTTTGATGG CGTGATGGCT GGCGGTATAA CACTCAAAGA GAAAAAGAAA 
GAAACAACGA CTTCTGCGGA TCAATCAAAA GGGTTAGCTA TTAATAATGA ATACTCCTAT 
GTTGTGGCTA TTATTCTTCA GCAAAATGAG ACAAAGGTTC AACCAGATTT AAAATTACTG 
GGGGTTAAAC CAGGCCAAGT CAACGCGCGA AACGTCATCA ATGTTTCTTT ACAAAACCCA 
CAAGCGGCCT ATTTAAACCA ATTACATTTA ATCAACACTG TTTCAAAAGG AGGCGAAACG 
CTTTACCAAT CCGATACTGA GGATATGCAA GTGGCGCCAA ACTCTAACTT TAGTTACCCA 
ATTTCTTTAA AAGGGGAACG AT 

EF089-4 (SEQ ID NO:340) 

SEFNFAVT PTIPENQVDK SKTYFDLKMA PGAKQTVEIQ LRNDTDEDIT 

lENTVNSATT NLNGWEYGQ NGIKPDKTLR FNLKDYVEAP KEIILPKHSQ KTLPLTITMP 

KDSFDGVMAG GITLKEKKKE TTTSADQSKG LAINNEYSYV VAIILQQNET KVQPDLKLLG 

VKPGQVNARN VINVSLQNPQ AAYLNQLHLI NTVSKGGETL YQSDTEDMQV APNSNFSYPI 

SLKGER 

EF090-1 (SEQ ID NO:341) 

TAGTCTCTAA GAAATAAACC TAAAATTATT GATATAAAGG ATGAACAAAT GAAAAAAGAA 
GAAATGCAAA TGCGTAATAC ACGTCGTCAA AAATCAGGAA AAAATAATAA AAA GAAAG TA 
ATTATTACTT CTTTGGTTGG ACTAGCTCTG GTTGCTGGGG GCAGTTATGT TTATTTTCAA 
AGTCACTTTT TNCCAACCAC AAAAGTAAAT GGAGTTTCTG TAGGCTGGTT AAATGTAAAT 
GCTGCAGAAG AAAAATTAGC GCAAGTTAAT CAAACCGAAG AAGTTGTGGT TCAAACGGGG 
ACAAAAGAAG AAAAAATTCA ACTTCCTAAA AAATACCAAT TGGATCAAAA ATTTTTAAAA 
GACCATTTAC ACAGTAGCAA GGTGAAGCTA CCGTTAAACG AGGCATTCAA AAAAGAACTA 
GAAGCCAAAT TAGCAACTTT GAGTTTTCCA GAGGGGAAAC CAAGCAAAAA TGCGAGTATC 
CGTCGAGGCA ATGGCACTTT TGAAATTGTT CCCGAAGAAC AAGGCACAGT AGTGGACACA 
CAGCGCTTAA ACCAGCAGAT TATTGCGGAT GTTGAAGCGG GAAAAGGCAA CTATCAATAT 
AATGCCAAAG ATTTTTATAA AGCCCCTGAA ATTACAAAAG AGGATCAAAC GTTAAAGGCA 
ACATTGACAA CGCTCAATAA CAAGTTAAAT AAAACAATTA CAGTTGATAT TAATGGTGAA 
AAAGTAGCCT TTCATAAAAC ACAAATTCAA AACGTGCTGA ATGATGATGG CACAATCAAC 
AAAGAAAAAC TAACTACTTG GGTGACACAA TTAGAAACAA CATATGGTTC TGCTAATCAA 
CCAGTTTTAT TTACAGATGT TCACGGCACG ACACGTCGTT TTAAAAACAA CGGAAGTTAT 
GGCTGGTCGA TTGATGGGGC CAAAACGCAA GAACTACTAG TAAACGCGCT GAATAGCCAA 
GAACAAACGA ATGCAATCAC TGCTCCGTTG GTTGGTGATA CCAAAGAAAA TAGTAAAATT 
GCCAATAATT ACATTGAAAT TGATTTAAAA GATCAAAAAA TGTATTGTTT CATTGATGGC 
AAAAAAATAG TCACCACAGA TGTCATTACT GGCAGATATA ACAAAGGAAC CGCAACAGTA 
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CCAGGATTCC ATACAATTTT ATATCGGACA ACCGATGTGA ATTTAGAAGG TCAAATGCTT 
GATGGTTCTC GATACAGTGT GCCAGTAAAA TATTGGATGC CGTTATTAAG TCAAGGGGGC 
GTTGTCACAC AAATCGGGAT TCATGACTCC GACCATAAAT TGGATAAGTA TGGCGATAAA 
GAAGCCTTTA AAACCGATGC TGGTAGTAAT GGCTGTATCA ATACGCCAGG AAGAGAAGTT 
TCAAAAATCT TTGATGTATC CTATGACGGA ATGCCGGTAA TTATTTATGG ACATATCTAT 
GATGATGCAC CAGGTGAATT TGATAAACCT GTAGATTACG GCGAAGAAGT ATAA 



EF090-2 (SEQ ID NO:342) 

MRNTRRQK SGKNNKKKVI ITSLVGLALV AGGSYVYFQS 

HFXPTTKVNG VSVGWLNVNA AEEKLAQVNQ TEEWVQTGT KEEKIQLPKK YQLDQKFLKD 
HLHSSKVKLP LNEAFKKELE AKLATLSFPE GKPSKNASIR RGNGTFEIVP EEQGTWDTQ 
RLNQQIIADV EAGKGNYQYN AKDFYKAPEI TKEDQTLKAT LTTLNNKLNK TITVDINGEK 
VAFDKTQIQN VLNDDGTINK EKLTTWVTQL ETTYGSANQP VLFTDVHGTT RRFKNNGSYG 
WSIDGAKTQE LLVNALNSQE QTNAITAPLV GDTKENSKIA NNYIEIDLKD QKMYCFIDGK 
KIVTTDVITG RYNKGTATVP GFHTILYRTT DVNLEGQMLD GSRYSVPVKY WMPLLSQGGV 
VTQIGIHDSD HKLDKYGDKE AFKTDAGSNG CINTPGTEVS KIFDVSYDGM PVIIYGHIYD 
DAPGEFDKPV DYGEEV 



EF090-3 (SEQ ID NO:343) 



CAC AAAAGTAAAT GGAGTTTCTG TAGGCTC 
GCTGCAGAAG AAAAATTAGC GCAAGTTAAT 
ACAAAAGAAG AAAAAATTCA ACTTCCTAAA 
GACCATTTAC ACAGTAGCAA GGTGAAGCTA 
GAAGCCAAAT TAGCAACTTT GAGTTTTCCA 
CGTCGAGGCA ATGGCACTTT TGAAATTGTT 
CAGCGCTTAA ACCAGCAGAT TATTGCGGAT 
AATGCCAAAG ATTTTTATAA AGCCCCTGAA 
ACATTGACAA CGCTCAATAA CAAGTTAAAT 
AAAGTAGCCT TTGATAAAAC ACAAATTCAA 
AAAGAAAAAC TAACTACTTG GGTGACACAA 
CCAGTTTTAT TTACAGATGT TCACGGCACG 
GGCTGGTCGA TTGATGGGGC CAAAACGCAA 
GAACAAACGA ATGCAATCAC TGCTCCGTTG 
GCCAATAATT ACATTGAAAT TGATTTAAAA 
AAAAAAATAG TCACCACAGA TGTCATTACT 
CCAGGATTCC ATACAATTTT ATATCGGACA 
GATGGTTCTC GATACAGTGT GCCAGTAAAA 
GTTGTCACAC AAATCGGGAT TCATGACTCC 
GAAGCCTTTA AAACCGATGC TGGTAGTAAT 
TCAAAAATCT TTGATGTATC CTATGACGGA 
GATGATGCAC CAGGTGAATT TGATAAACCT 

EF090-4 (SEQ ID NO:344) 



JGTT AAATGTAAAT 

CAAACCGAAG AAGTTGTGGT TCAAACGGGG 
AAATACCAAT TGGATCAAAA ATTTTTAAAA 
CCGTTAAACG AGGCATTCAA AAAAGAACTA 
GAGGGGAAAC CAAGCAAAAA TGCGAGTATC 
CCCGAAGAAC AAGGCACAGT AGTGGACACA 
GTTGAAGCGG GAAAAGGCAA CTATCAATAT 
ATTACAAAAG AGGATCAAAC GTTAAAGGCA 
AAAACAATTA CAGTTGATAT TAATGGTGAA 
AACGTGCTGA ATGATGATGG CACAATCAAC 
TTAGAAACAA CATATGGTTC TGCTAATCAA 
ACACGTCGTT TTAAAAACAA CGGAAGTTAT 
GAACTACTAG TAAACGCGCT GAATAGCCAA 
GTTGGTGATA CCAAAGAAAA TAGTAAAATT 
GATCAAAAAA TGTATTGTTT CATTGATGGC 
GGCAGATATA ACAAAGGAAC CGCAACAGTA 
ACCGATGTGA ATTTAGAAGG TCAAATGCTT 
TATTGGATGC CGTTATTAAG TCAAGGGGGC 
GACCATAAAT TGGATAAGTA TGGCGATAAA 
GGCTGTATCA ATACGCCAGG AACAGAAGTT 
ATGCCGGTAA TTATTTATGG ACATATCTAT 
GTAGATTACG GCGAAGAAGT AT 



TKVNG VSVGWLNVNA AEEKLAQVNQ TEEWVQTGT KEEKIQLPKK YQLDQKFLKD 
HLHSSKVKLP LNEAFKKELE AKLATLSFPE GKPSKNASIR RGNGTFEIVP EEQGTWDTQ 
RLNQQIIADV EAGKGNYQYN AKDFYKAPEI TKEDQTLKAT LTTLNNKLNK TITVDINGEK 
VAFDKTQIQN VLNDDGTINK EKLTTWVTQL ETTYGSANQP VLFTDVHGTT RRFKNNGSYG 
WSIDGAKTQE LLVNALNSQE QTNAITAPLV GDTKENSKIA NNYIEIDLKD QKMYCFIDGK 
KIVTTDVITG RYNKGTATVP GFHTILYRTT DVNLEGQMLD GSRYSVPVKY WMPLLSQGGV 
VTQIGIHDSD HKLDKYGDKE AFKTDAGSNG CINTPGTEVS KIFDVSYDGM PVIIYGHIYD 
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DAPGEFDKPV DYGEEV 
EF091-1 (SEQ ID NO:345) 

TAATTGGNGG AGATTTTTAT GGCTAAAAAA GGCGGATTTT TCTTAGGNGC AGTAATTGGT 
GGAACAGCAG CAGCCGTTGC CGCATTATTA CTTGCACCAA AATCAGGTAA AGAATTACGT 
GATGATTTAT CAAATCAAAC AGATGATTTA AAAAACAAAG CGCAAGATTA CACAGATTAT 
GCTGTTCAAA AAGGAACAGA ATTAACAGAA ATCGCAAAAC AAAAAGCCGG CGTTTTATCA 
GATCAAGCCT CTGATTTGGC AGGTTCTGTC AAAGAAAAAA CAAAAGATTC ATTGGATAAA 
GCACAAGGTG TTTCTGGCGA CATGCTTGAT AACTTTAAAA AACAAAGAGG TGATTTATCT 
GATCAATTTA AAAAAGCAGC TGACGATGCT CAAGATCACG CAGAAGATTT AGGTGAAATT 
GCCGAAGATG CAGCAGAAGA TATCTATATT GACGTTAAAG ATTCTGCGGC AGCGGCCAAA 
GAAACTGTTT CTGCTGGTGT CGATGAAGCA ANAGAAACCA CCAAAGATGT TCCTGAAAAA 
GCTGCAGAAG CAAAAGAAGA TGTTAAAGAT GCAGCGAAAG ACGTAAAAAA AGAATTTAAA 
GGGTAA 

EF091-2 (SEQ ID NO:346) 

MAKKG GFFLGAVIGG TAAAVAALLL APKSGKELRD DLSNQTDDLK NKAQDYTDYA 
VQKGTELTEI AKQKAGVLSD QASDLAGSVK EKTKDSLDKA QGVSGDMLDN FKKQTGDLSD 
QFKKAADDAQ DHAEDLGEIA EDAAEDIYID VKDSAAAAKE TVSAGVDEAX ETTKDVPEKA 
AEAKEDVKDA AKDVKKEFKG 

EF091-3 (SEQ ID NO:347) 

AT CAAATCAAAC AGATGATTTA AAAAACAAAG CGCAAGATTA CACAGATTAT 
GCTGTTCAAA AAGGAACAGA ATTAACAGAA ATCGCAAAAC 
GATCAAGCCT CTGATTTGGC AGGTTCTGTC AAAGAAAAAA 
GCACAAGGTG TTTCTGGCGA CATGCTTGAT AACTTTAAAA 
GATCAATTTA AAAAAGCAGC TGACGATGCT CAAGATCACG 
GCCGAAGATG CAGCAGAAGA TATCTATATT GACGTTAAAG 
GAAACTGTTT CTGCTGGTGT CGATGAAGCA ANAGAAACCA 
GCTGCAGAAG CAAAAGAAGA TGTTAAAGAT GCAGCGAAAG 
GGGTAA 

EF091-4 (SEQ ID NO:348) 
SNQTDDLK NKAQDYTDYA 

VQKGTELTEI AKQKAGVLSD QASDLAGSVK EKTKDSLDKA 
QFKKAADDAQ DHAEDLGEIA EDAAEDIYID VKDSAAAAKE 
AEAKEDVKDA AKDVKKEFKG 

EF092-1 (SEQ ID NO:349) 

TAAGGGGATG AAGAAAAAAT GGCAAAAAAA ACAATTATGT 
AGCACGAGTT TATTAGTAAC AAAAATGCAA AAAGCAGCAG 
GACATCTTTG CAGTATCGGC TTCTGAAGCA GATACAAACT 
GTTTTACTTT TAGGTCCACA AGTTCGTTTC ATGAAAGGGC 
CCAAAAGGGA TTCCTTTAGA TGTAATTAAC ATGGCAGATT 
AAAGTTTTAG ATCAAGCAAT CTCATTAATG GGATAA 

EF092-2 (SEQ ID NO:350) 

MAKKT IMLVCSAGMS TSLLVTKMQK AAEDRGMEAD IFAVSASEAD TNLENKEVNV 



AAAAAGCCGG CGTTTTATCA 
CAAAAGATTC ATTGGATAAA 
AACAAACAGG TGATTTATCT 
CAGAAGATTT AGGTGAAATT 
ATTCTGCGGC AGCGGCCAAA 
CCAAAGATGT TCCTGAAAAA 
ACGTAAAAAA AGAATTTAAA 



QGVSGDMLDN FKKQTGDLSD 
TVSAGVDEAX ETTKDVPEKA 



TAGTTTGTTC CGCAGGAATG 
AAGATCGTGG CATGGAAGCA 
TGGAAAATAA AGAGGTGAAT 
AATTTGAACA AAAATTACAA 
ATGGCATGAT GAATGGCGAA 
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LLLGPQVRFM KGQFEQKLQP KGIPLDVINM ADYGMMNGEK VLDQAISLMG 
EF092-3 (SEQ ID NO:351) 
AG AAGATCGTGG CATGGAAGCA 

GACATCTTTG CAGTATCGGC TTCTGAAGCA GATACAAACT TGGAAAATAA AGAGGTGAAT 
GTTTTACTTT TAGGTCCACA AGTTCGTTTC ATGAAAGGGC. AATTTGAACA AAAATTACAA 
CCAAAAGGGA TTCCTTTAGA TGTAATTAAC ATGGCAGATT ATGGCATGAT GAATGGCGAA 
AAAGTTTTAG ATCAAGCAAT CTCATTAATG GGAT 

EF092-4 (SEQ ID NO: 352) 

EDRGMEAD IFAVSASEAD TNLENKEVNV 

LLLGPQVRFM KGQFEQKLQP KGIPLDVINM ADYGMMNGEK VLDQAISLMG 
EF093-1 (SEQ ID NO: 353) 

TAGTTTTTTT CCGATAAAGG GAGAATTTTA ATGAGGCAAA AATATTCAGG AAACTTATTG 
TTCACGGCCA TGGCCATTGT TTATTTGATG AGTTTTCTCG CCCTTCAGTT ACTAGAAGAA 
CGTCAGTTAA CACAAAAATT TACGCAAGCT ACCCAGGAAT ACTATGCAGG GAAAAGTATC 
TTTCATTTAT TTCTTGCAGA TGTTAAACAA AATAGACGAA AGTTAAAAAC AGAAGAAAGG 
CTCGTATACG CGCAAGTGAC CCTCGATTAT ACATACAAAA ATGAACAATT AAGAATAACT 
GTTTTATTAA ACAAATCTGG TCGAAAATAC CAATATCAAG AGAGAGTTTC TCATCAAAAA 
AAAGCGGAAA CAATACTGGA ATAG 

EF093-2 (SEQ ID NO:354) 

M RQKYSGNLLF TAMAIVYLMS FLALQLLEER QLTQKFTQAT QEYYAGKSIF 
HLFLADVKQN RRKLKTEERL VYAQVTLDYT YKNEQLRITV LLNKSGRKYQ YQERVSHQKK 
AETILE 

EF093-3 (SEQ ID NO:355) 
CCTTCAGTT ACTAGAAGAA 

CGTCAGTTAA CACAAAAATT TACGCAAGCT ACCCAGGAAT ACTATGCAGG GAAAAGTATC 
TTTCATTTAT TTCTTGCAGA TGTTAAACAA AATAGACGAA AGTTAAAAAC AGAAGAAAGG 
CTCGTATACG CGCAAGTGAC CCTCGATTAT ACATACAAAA ATGAACAATT AAGAATAACT 
GTTTTATTAA ACAAATCTGG TCGAAAATAC CAATATCAAG AGAGAGTTTC TCATCAAAAA 
AAAGCGGAAA CAATACTGG 

EF093-4 (SEQ ID NO:356) 

LQLLEER QLTQKFTQAT QEYYAGKSIF 

HLFLADVKQN RRKLKTEERL VYAQVTLDYT YKNEQLRITV LLNKSGRKYQ YQERVSHQKK 
AETI 

EF094-1 (SEQ ID NO:357) 

TAAACATTTG AGACATTCAG AGGTGAATGT CTCTTTTTTA TTACTCAAAA ACGAAAGGGG 
ATTAATTATA TGAAAAAAAC AACATTTAAA AATTGGTCGT TATTTGCGAC TTTGGCTCTA 
TTAAGTCAAA CAATTGGCGG AACGATTGGT CCTACGATTG CTTTTGCCGA TGAAATTACT 
CACCCTCAAG AGGTAACAAT TCATTATGAC GTAAGTAAAC TGTATGAAGT TGACGGAACT 
TTTAGCGATG GCAGCACGCT CTCAGAACGT ACTACGTCAT TATATGCAGA ATACAATGGT 
GCAAAACAAA CAGTATTTTG TATTGAACCA GGTGTTAGTA TTCCAACAGA AGTGACGCAC 
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GGTTATCAGA AAAACCCTTT GCCATCAATG TCTGATAAAG CGAAACTAGT ATCGGTTCTT 
TGGGAAAAGG CTGGAACAGA TATTGATACA AATATGGTTG CACAAAAGAT GATTTGGGAA 
GAAGTGAACG GTTATAAACT CCATTCCATA AAAAGATTAG GTGGTGCTTC AGTTGATATA 
AAATCTATTG AAGGAAAAAT TAATAAGGCA ATTGAGGAGT ATCAAAAAAA ACCAAGTTTT 
CATAATACCA CTGTAAAAAC AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA 
TTAAATTTAT CTGAGTTTGA TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA 
ATTGGGAATC AATTAGTGCT TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG 
AAAAAATCAG CTGGTACTGG AACTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG 
ATGGCTGGTG CGCTTGATAA GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG 
GGTTCTTTAA AGATCAAAAA AATCGATAAA GAATCAGGTG ATATTGTACC AGAAACGGTT 
TTCCATTTAG ATTTl^GGGAA AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGG 
ATTTCTATTT TGGATGGAAT TCCCCATGGT ACAAAGGTAA CTATTACTQA AAAATCGGTG 
CCAGATCCTT ATATGATTGA TACCACACCC ATGGCTGCCA CCATTAAAGC GGGCGAGACC 
ATTTCCATGA CTTCGAAAAA TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG 
GTAGAAACAG GTACTGATCT TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC 
ATTCGTAAAG ACAGCCCAGC TGGTGAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT 
CGTGGGGAAA CACCAAAAGA GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA 
GAAACTAAAT CTAGTAATGG TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA 
TATGCCAATC AAACCGTGGC TCTTGTTACC AGTAACGTAA AAGGGCAAAA CCAAGAAATT 
ACTGGGGAAA CCACTTTGAC AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAAGGGAAA 
GCTGAGTTTA AAGGAGCTGA ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA 
TGGAGTGAAG CTTTTAAAAC AGAATTAGTG AAGGGAACGA AAGCTTCTGA TGAAACAGTG 
ACTTTGGCTT TAGATGAAAA GAACCAAGTT GCCGTTAAAC ACCTAGCAAT TAACGAGTAT 
TTCTGGCAAG AAACCAAAGC ACCTGAAGGA TATACTTTGG ATGAAACGAA GTATCCTGTA 
TCCATCAAAA AAGTTGATAA TAACGAAAAA AATGCCGTAA TTACTCGAGA TGTTACGGCA 
AAAGAACAAG TTATTCGCTT TGGCTTTGAT TTCTTTAAAT TTGCTGGATC GGCTGATGGC 
ACTGCCGAAA CTGGATTTAA CGACTTATCT TTTAAAGTGT CGCCATTGGA AGGGACCAAN 
GAAATCACAG GTGCTGAAGA TAAAGCGACC ACAGCTTGTA ACGAGCAATT AGGTTTTGAT 
GGCTATGGTA AGTTTGAAAA TCTTCCTTAT GGGGATTATT TACTTGAAGA AATAGAGGCT 
CCAGAAGGAT TTCAAAAGAT TACACCACTA GAAATCCGTT CTACATTTAA GGAAAACAAA 
GACGACTATG CGAAGAGTGA GTATGTCTTT ACCATTACCG AAGAAGGACA AAAACAACCA 
ATTAAGATGG TGACCGTTCC TTACGAGAAA CTAACTAACA ACGAGTTTTC TGTTAGTCTG 
AACCGTTTGA TGCTTTATGA TTTGCCCGAG AAAGAAGATA GTTTGACTTC TCTTGCGACT 
TGGAAAGACG GAAATAAAAA ATTGAATACC CTTGATTTTA CCGAGCTAGT TGATAAATTG 
AGATATAACT TGCATGAAAT CAAAGAAGAC TGGTATGTCG TAGCTCAAGC CATTGATGTG 
GAAGCCACAA AAGCTGCCCA AGAAAAAGAC GAAAAAGCCA AACCGGTGGT GATTGCCGAA 
ACAACCGCAA CGTTGGCGAA CAAAGAGAAA ACTGGAACTT GGAAAATTCT GCATAAATTA 
ACCGCTGAAC AAGTTTTGGA TAAAAGCATC GTCTTGTTCA ATTATGTGTA TGAAAACAAG 
GTAGCCTTTG AAGCAGGCAA TGAGCCAGTA GCGAAGGATG CTAGCTTGAA CAATCAAGCA 
CAAACCGTCA ATTGTACGAT TGAACGCCAT GTTTCCATCC AAACAAAAGC CCACCTAGAA 
GATGGTTCGC AAACTTTTAC TCATGGTGAC GTGATGGATA TGTTTGATGA TGTGTGGGTT 
ACCCATGATG TACTGGATGG CTCAAAAGAA GCTTTCGAAA CAATTCTGTA TGCTTTACTA 
CCAGATGGTA CGAACAAAGA AATTTGGAAA TCTGGCAAAA TTGAGCATGA AGTGAATGAT 
AAAGAATTTA CCAAAACCGT ACTTGCGGAA AAAGTAGATA CCGGAAAGTA TCCAGAAGGA 
ACTAAGTTTA CTTTTACGGA AATCAATTAC GAAAAAGATG GAAACGTGAA TGGAAAACAC 
AATGAAGATT TGAAAGAAAA ATCTCAAACC TTAACACCAA AAGAAGTGCC AACCATACGG 
AGTACGCCAA AACAACCGGA AACACCAGCT GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC 
ACAGTGAAGA CATTCCCGCA AACTGGGGAG AAAAATTCCA ACGTTCTACT GTTAGTTGGC 
TTTATCTTGA TTTTTTCGAC TGCTGGGTAT TATTTCTGGA ATCGCCGCAA TTAA 

EF094-2 (SEQ ID NO:358) 

MKKTTFKN WSLFATLALL SQTIGGTIGP TIAFADEITH 

PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA KQ1VFCIEPG VSIPTEVTHG 
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YQKNPLPSMS DKAKLVSVLW EKAGTDIDTN MVAQKMIWEE VNGYKLHSIK RLGGASVDIK 
SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL NLSEFDKWQ NTANIDYRVI 
GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM AGALDKPNTY AIKINVETKG 
SLKIKKIDKE SGDIVPETVF HLDFGKALPS KDVTTDKDGI SILDGIPHGT KVTITEKSVP 
DPYMIDTTPM AATIKAGETI SMTSKNMRQK GQILLEKTGV ETGTDLWNDN YSLAGNTFAI 
RKDSPAGEIV QEITTDEKGR AETPKELANA LELGTYYVTE TKSSNGFVNT FKPTKVELKY 
ANQTVALVTS NVKGQNQEIT GETTLTKEDK DTGNESQGKA EFKGAEYTLF TAKDGQAVKW 
SEAFKTELVK GTKASDETVT LALDEKNQVA VKHLAINEYF WQETKAPEGY TLDETKYPVS 
IKKVDNNEKN AVITRDVTAK EQVIRFGFDF FKFAGSADGT AETGFNDLSF KVSPLEGTXE 
ITGAEDKATT ACNEQLGFDG YGKFENLPYG DYLLEEIEAP EGFQKITPLE IRSTFKENKD 
DYAKSEYVPT ITEEGQKQPI KMVTVPYEKL TNNEFSVSLN RLMLYDLPEK EDSLTSLATW 
KDGNKKLNTL DFTELVDKLR YNLHEIKEDW YWAQAIDVE ATKAAQEKDE KAKPWIAET 
TATLANKEKT GTWKILHKLT AEQVLDKSIV LFNYVYENKV AFEAGNEPVA KDASLNNQAQ 
TVNCTIERHV SIQTKAHLED GSQTFTHGDV MDMFDDVSVT HDVLDGSKEA FETILYALLP 
DGTNKEIWKS GKIEHEVNDK EFTKTVLAEK VDTGKYPEGT KFTFTEINYE KDGNVNGKHN 
EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VKTFPQTGEK NSNVLLLVGF 
ILIFSTAGYY FWNRRN 

EF094-3 (SEQ ID NO:359) 

CGA TGAAATTACT 

CACCCTCAAG AGGTAACAAT TCATTATGAC GTAAGTAAAC TGTATGAAGT TGACGGAACT 
TTTAGCGATG GCAGCACGCT CTCAGAACGT ACTACGTCAT TATATGCAGA ATACAATGGT 
GCAAAACAAA CAGTATTTTG TATTGAACCA GGTGTTAGTA TTCCAACAGA AGTGACGCAC 
GGTTATCAGA AAAACCCTTT GCCATCAATG TCTGATAAAG CGAAACTAGT ATCGGTTCTT 
TGGGAAAAGG CTGGAACAGA TATTGATACA AATATGGTTG CACAAAAGAT GATTTGGGAA 
GAAGTGAACG GTTATAAACT CCATTCCATA AAAAGATTAG GTGGTGCTTC AGTTGATATA 
AAATCTATTG AAGGAAAAAT TAATAAGGCA ATTGAGGAGT ATCAAAAAAA ACCAAGTTTT 
CATAATACCA CTGTAAAAAC AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA 
TTAAATTTAT CTGAGTTTGA TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA 
ATTGGGAATC AATTAGTGCT TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG 
AAAAAATCAG CTGGTACTGG AACTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG 
ATGGCTGGTG CGCTTGATAA GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG 
GGTTCTTTAA AGATCAAAAA AATCGATAAA GAATCAGGTG ATATTGTACC AGAAACGGTT 
TTCCATTTAG ATTTTGGGAA AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGG 
ATTTCTATTT TGGATGGAAT TCCCCATGGT ACAAAGGTAA CTATTACTGA AAAATC<3GTG 
CCAGATCCTT ATATGATTGA TACCACACCC ATGGCTGCCA CCATTAAAGC GGGCGAGACC 
ATTTCCATGA CTTCGAAAAA TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG 
GTAGAAACAG GTACTGATCT TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC 
ATTCGTAAAG ACAGCCCAGC TGGTGAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT 
CGTGCGGAAA CACCAAAAGA GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA 
GAAACTAAAT CTAGTAATGG TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA 
TATGCCAATC AAACCGTGGC TCTTGTTACC AGTAACGTAA AAGGGCAAAA CCAAGAAATT 
ACTGGGGAAA CCACTTTGAC AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAAGGGAAA 
GCTGAGTTTA AAGGAGCTGA ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA 
TGGAGTGAAG CTTTTAAAAC AGAATTAGTG AAGGGAACGA AAGCTTCTGA TGAAACAG 

EF094-4 (SEQ ID NO:360) 

DEITH 

PQEVTIHYbV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA KQTVFCIEPG 
YQKNPLPSMS DKAKLVSVLW EKAGTDIDTN MVAQKMIWEE VNGYKLHSIK 
SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL NLSEFDKWQ 
GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM AGALDKPNTY 



VSIPTEVTHG 
RLGGASVDIK 
NTANIDYRVI 
AIKINVETKG 
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SLKIKKIDKE SGDIVPETVF HLDFGKALPS KDVTTDKDGI SILDGIPHGT KVTITEKSVP 
DPYMIDTTPM AATIKAGETI SMTSKNMRQK GQILLEKTGV ETGTDLWNDN YSLAGNTFAI 
RKDSPAGEIV QEITTDEKGR AETPKELANA LELGTYYVTE TKSSNGFVNT FKPTKVELKY 
ANQTVALVTS NVKGQNQEIT GETTLTKEDK DTGNESQGKA EFKGAEYTLF TAKDGQAVKW 
SEAFKTELVK GTKASDET 

EF095-1 {SEQ ID NO:361) 

TAAGAATTGT TGGATTGTTC TTTAGAAAGA AGGGACAATA TGAAGCGAAG TAAATGGAAA 
GAATTGATAG TAACGGGCAT CTGCCATATA TTAGTATTCC CCATACTAAT ACAGACAACT 
GTTTTTGCAG AAACATTACC AAGTACAAAA CAAGTAAGAG AAGGAACCAA TCATTCATTA 
ACAGCAGAAA AAGCCGAAAG TGAACAACCA CAGACAAAGG ATAAACTACA TGATGAAGAA 
ACACTGGCAT TGTCAAAAAG TGAGTTAATC GATAATGAGG CTAATGTTAC AAGTCAAACG 
ATTAGAGAAA GAATTGAGAC GCCTAACCTA ACTTATCGTT ATGGATTTAT TAATGAAGAG 
GGGCAGCCAG TAAACGCCAA TGAGATCCTT CTACAGTATC ATAGTTGGCA AGGCAATTCC 
CCAGATGGCA TAAATGTGTG GGAAGGTGAA AGTCAACCAG TGACAGCATC TACAGTGGCT 
AATTTAAAAG AAGTGGTAAT TCCAAGTGAG AAAGTAGCCG TCTATTCCGA CATGTCAACG 
GTGCTTGCAG CGAGTAATCA AACATTTTTT TTACCAAGAT ATTATACTTC TTTAAGCTTA 
TACAATAAGA AAGGGGAAAT TGATCCCAAT TATCCGCTGC CAACTATTTC CGACGCATCA 
GGAAACCAAT ATCCAACAAC AATTTCGCAA TTTGAATTGG AAAAAATGTC TGCACAACAA 
TATAGTCAGA AAACAGGAGT AACGTTTAAC ATTAGCGAGA GTCAAAAACT AATCGTTCCT 
TTGTACAACC AAGTGAAGGT TGATTCATCG AATCAATCTG GGCTATTGAA TTACTTTAAA 
TTTTCAGGGC CGGTTTATTA TCATGTTACC AATCGCAAAG TGACAGAACA TTTTGTGGAT 
ACTCAAGGGA AACCAATCCC TCCACCACCG GGGTTTAGAC AAGGAAAGCA AACACTTATT 
GAGCGTCACC CTTACACCTT TAAACAGAAA GATCTTTTGC CAAGTAGCTA TGAAATTGAC 
TCAAAAACGT ATCAATTTCA AGGATGGTAT AAAGGGAAAA CGAAACCTGA AAATTTAGAA 
AAAAGCGTAA CGCCCAGTTA TGATATTACC TATGACGACA ATGATGATTT AACTGTTGTC 
TATAAGGAGA TACCTCAAAA AAATTATACA TTTGAGGATG TCAATGGTGT TGAAATTGCA 
CCACCATCTG ATTTTATTCA GGATCACCAA CAACCAATAA CTACGGATGG CTTTCGCTAT 
TTAGCTGGAA AAAAACTGCC ACAACAATAC AGCGTTAACG GTAAAACTTA TTTATATCAA 
GGTTGGTATC AAGATAAAAC NAAACAAGAG AGCTTAGAAA AAACGAAGCG ACCCATAAAC 
TCCCCTCTTT TTAATGAAAT GAACGCTATT ACAGCAGTGT ATAAGGAAAT AACTGCAAAA 
GCTGAAATGC AAATAGAAGG ACTAGTCAAA GTCATGCCAA GTGGTTATAT ACAAATTTGG 
CAGATTATGC TTACAAATGT GGGAGAAGTA CCGTTAAAAA AAATAAACTT AAAGCCAGCA 
AGTGGTTGGT CACCAGGTCT AGCTCGGCCA ATCCAAGTCA CGATTCGTGT TGGATCTGAA 
CCAAACAAAA TTGTTCCTAT TACTGATGAA AATTGGCGAG TTGGCATTAC TTTAAATACG 
GAAGTGCCTA TTGGTCAGAC AGCAACTATT ATGATGACAA CAATTGCTAC AGGTGAACCA 
GATCAAGTGT TACAAGCGGC TGTTGAAATG AATGGAAATT TTTCTGCTGT TCACGCAGCT 
GATACTGTCA GAATCCAACC TAAAAATCAA GAAATTGTGG CACCAGATGA GGAAGGTTTT 
ATCAGCACAC CAACTTTTGA TTTTGGCAAA GTCGCCATTT CTAGCAACAC GCAGCAACAT 
GGTTTAAAGC AGGCAGCAGA TTATTATGAA AATGGTCAGG AAAATCCATA TTTACGTTTG 
AAAAAATCAC AACCCAATTG GGCACTAACT GCAGAACTAT CCCCCTTTGA AGGAAGAGTG 
GATCAACTAT CATCAATGAC AAAGTTATTG TTAGGAACAA CCAATGTTTC AGGTTTTATT 
CAGTACAATC AACCAACGGA AACTAAAGTT GCTCTTGGCA AAACAACCGC TATTCAATTA 
GTTGCCAACG GTGTAGCTAG CCATATTGTT GCCAATGGTC AGTTTGACGA AAGTGATGTT 
TATCAATTTG ATTTTTCTTT TGATCAAATC AAATTAGAAA TTCCAGCAAA TCAAGGTAGA 
AAAGATCAAA CTTATCAAGC AATGGTGACT TGGAATTTAG TGACAGGCCC ATAA 



EF095-2 (SEQ ID NO:362) 

MKRSKWKE LIVTGICHIL VFPILIQTTV FAETLPSTKQ VREGTNHSLT 

AEKAESEQPQ TKDKLHDEET LALSKSELID NEANVTSQTI RERIETPNLT YRYGFINEEG 

QPVNANEILL QYHSWQGNSP DGINVWEGES QPVTASTVAN LKEWIPSEK VAVYSDMSTV 
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TABLE L Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

LAASNQTFFL PRYYTSLSLY NKKGEIDPNY PLPTISDASG NQYPTTISQF ELEKMSAQQY 
SQKTGVTFNI SESQKLIVPL YNQVKVDSSN QSGLLNYFKF SGPVYYHVTN RKVTEHFVDT 
QGKPIPPPPG FRQGKQTLIE RDPYTFKQKD LLPSSYEIDS KTYQFQGWYK GKTKPENLEK 
SVTPSYDITY DDNDDLTWY KEIPQKNYTF EDVNGVEIAP PSDFIQDHQQ PITTDGFRYL 
AGKKLPQQYS VNGKTYLYQG WYQDKTKQES LEKTKRPINS PVFNEMNAIT AVYKEITAKA 
EMQIEGLVKV MPSGYIQIWQ IMLTNVGEVP LKKINLKPAS GWSPGLARPI . QVTIRVGSEP 
NKIVPITDEN WRVGITLNTE VPIGQTATIM MTTIATGEPD QVLQAAVEMN GNFSAVHAAD 
TVRIQPKNQE IVAPDEEGFI STPTFDFGKV AISSNTQQHG LKQAADYYEN GQENPYLRLK 
KSQPNWALTA ELSPFEGRVD QLSSMTKLLL GTTNVSGFIQ YNQPTETKVA LGKTTAIQLV 
ANGVASHIVA NGQFDESDVY QFDFSFDQIK LEIPANQGRK DQTYQAMVTW NLVTGP 

EF095-3 (SEQ ID NO:363) 

AAGTACAAAA CAAGTAAGAG AAGGAACCAA TCATTCATTA 

ACAGCAGAAA AAGCCGAAAG TGAACAACCA CAGACAAAGG ATAAACTACA TGATGAAGAA 
ACACTGGCAT TGTCAAAAAG TGAGTTAATC GATAATGAGG CTAATGTTAC AAGTCAAACG 
ATTAGAGAAA GAATTGAGAC GCCTAACCTA ACTTATCGTT ATGGATTTAT TAATGAAGAG 
GGGCAGCCAG TAAACGCCAA TGAGATCCTT CTACAGTATC ATAGTTGGCA AGGCAATTCC 
CCAGATGGCA TAAATGTGTG GGAAGGTGAA AGTCAACCAG TGACAGCATC TACAGTGGCT 
AATTTAAAAG AAGTGGTAAT TCCAAGTGAG AAAGTAGCCG TCTATTCCGA CATGTCAACG 
GTGCTTGCAG CGAGTAATCA AACATTTTTT TTACCAAGAT ATTATACTTC TTTAAGCTTA 
TACAATAAGA AAGGGGAAAT TGATCCCAAT TATCCGCTGC CAACTATTTC CGACGCATCA 
GGAAACCAAT ATCCAACAAC AATTTCGCAA TTTGAATTGG AAAA/^TGTC TGCACAACAA 
TATAGTCAGA AAACAGGAGT AACGTTTAAC ATTAGCGAGA GTCAAAAACT AATCGTTCCT 
TTGTACAACC AAGTGAAGGT TGATTCATCG AATCAATCTG GGCTATTGAA TTACTTTAAA 
TTTTCAGGGC CGGTTTATTA TCATGTTACC AATCGCAAAG TGACAGAACA TTTTGTGGAT 
ACTCAAGGGA AACCAATCCC TCCACCACCG GGGTTTAGAC AAGGAAAGCA AACACTTATT 
GAGCGTGACC CTTACACCTT TAAACAGAAA GATCTTTTGC CAAGTAGCTA TGAAATTGAC 
TCAAAAACGT ATCAATTTCA AGGATGGTAT AAAGGGAAAA CGAAACCTGA AAATTTAGAA 
AAAAGCGTAA CGCCCAGTTA TGATATTACC TATGACGACA ATGATGATTT AACTGTTGTC 
TATAAGGAGA TACCTCAAAA AAATTATACA TTTGAGGATG TCAATGGTGT TGAAATTGCA 
CCACCATCTG ATTTTATTCA GGATCACCAA CAACCAATAA CTACGGATGG CTTTCGCTAT 
TTAGCTGGAA AAAAACTGCC ACAACAATAC AGCGTTAACG GTAAAACTTA TTTATAOXrAA 
GGTTGGTATC AAGATAAAAC NAAACAAGAG AGCTTAGAAA AAACGAAGCG ACCCATAAAC 
TCCCCTGTTT TTAATGAAAT GAACGCTATT ACAGCAGTGT ATAAGGAAAT AACTGCAAAA 
GCTGAAATGC AAATAGAAGG ACTAGTCAAA GTCATGCCAA GTGGTTATAT ACAAATTTGG 
CAGATTATGC TTACAAATGT GGGAGAAGTA CCGOTAAAAA AAATAAACTT AAAGCCAGCA 
AGTGGTTGGT CACCAGGTCT AGCTCGGCCA ATCCAAGTCA CGATTCGTGT TGGATCTCAA 
CCAAACAAAA TTGTTCCTAT TACTGATGAA AATTGGCGAG TTGGCATTAC TTTAAATACG 
GAAGTGCCTA TTGGTCAGAC AGCAACTATT ATGATGACAA CAATTGCTAC AGGTGAACCA 
GATCAAGTGT TACAAGCGGC TGTTGAAATG AATGGAAATT TTTCTGCTGT TCACGCAGCT 
GATACTGTCA GAATCCAACC TAAAAATCAA GAAATTGTGG CACCAGATGA GGAAGGTTTT 
ATCAGCACAC CAACTTTTGA TTTTGGCAAA GTCGCCATTT CTAGCAACAC GCAGCAACAT 
GGTTTAAAGC AGGCAGCAGA TTATTATGAA AATGGTCAGG AAAATCCATA TTTACGTTTG 
AAAAAATCAC AACCCAATTG GGCACTAACT GCAGAACTAT CCCCCTTTGA AGGAAGAGTG 
GATCAACTAT CATCAATGAC AAAGTTATTG TTAGGAACAA CCAATGTTTC AGGTTTTATT 
CAGTACAATC AACCAACGGA AACTAAAGTT GCTCTTGGCA AAACAACCGC TATTCAATTA 
GTTGCCAACG GTGTAGCTAG CCATATTGTT GCCAATGGTC AGTTTGACGA AAGTGATGTT 
TATCAATTTG ATTTTTCTTT TGATCAAATC AAATTAGAAA TTCCAGCAAA TCAAGGTAGA 
AAAGATCAAA CTTATCAAGC AATGGTGACT TGGAATTTAG TGACAGGCCC A 

EF095-4 (SEQ ID NO:364) 

STKQ VREGTNHSLT 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

AEKAESEQPQ TKDKLHDEET LALSKSELID NEANVTSQTI RERIETPNLT YRYGFINEEG 
QPVNANEILL QYHSWQGNSP DGINVWEGES QPVTASTVAN LKEWIPSEK VAVYSDMSTV 
liAASNQTFFL PRYYTSLSLY NKKGEIDPNY PLPTISDASG NQYPTTISQF ELEKMSAQQY 
SQKTGVTFNI SESQKLIVPL YNQVKVDSSN QSGLLNYFKF SGPVYYHVTN RKVTEHFVDT 
QGKPIPPPPG FRQGKQTLIE RDPYTFKQKD LLPSSYEIDS KTYQFQGWYK GKTKPENLEK 
SVTPSYDITY DDNDDLTWY KEIPQKNYTF EDVNGVEIAP PSDFIQDHQQ PITTDGFRYL 
AGKKLPQQYS VNGKTYLYQG WYQDKTKQES LEKTKRPINS PVFNEMNAIT AVYKEITAKA 
EMQIEGLVKV MPSGYIQIWQ IMLTNVGEVP LKKINLKPAS GWSPGLARPI QVTIRVGSEP 
NKIVPITDEN WRVGITLNTE VPIGQTATIM MTTIATGEPD QVLQAAVEMN GNFSAVHAAD 
TVRIQPKNQE IVAPDEEGFI STPTFDFGKV AISSNTQQHG LKQAADYYEN GQENPYLRLK 
KSQPNWALTA ELSPFEGRVD QLSSMTKLLL GTTNVSGFIQ YNQPTETKVA LGKTTAIQLV 
ANGVASHIVA NGQFDESDVY QFDFSFDQIK LEIPANQGRK DQTYQAMVTW NLVTGP 

EF096-1 (SEQ ID NO:365) 

TGAGGTGGCC AAGTTAAAAT GAAAAAATTA CAGTCACTTT TTATTGGAAT TATCGCTATT 
ATTGTCATCT TGTTTTTTGG CGTGCGCCAA TTGGAGAAAG CAAGTGGCAT GGCAGGAGCA 
GATACCTTGA CCATTTACAA TTGGGGGGAC TATATAGATC CGGCCTTGAT TAAGAAATTT 
GAAAAAGAAA CAGGCTATAA AGTCAATTAC GAAACCTTTG ATTCTAATGA AGCTATGTAT 
ACAAAAATTC AGCAAGGTGG CACAGCCTAT GATATTGCCA TTCCTTCTGA ATATATGATT 
CAAAAAATGA TGAAAGCGAA GATGCTTTTA CCACTTGATC ACAGCAAATT AAAAGGCTTA 
GAAAACATTG ATGCACGCTT TTTAGATCAA TCCTTTGATC CCAAAAATAA GTTTTCCGTT 
CCGTACTTCT GGGGCACGTT GGGGATTATT TATAATGATA AATTTATTCA CGGCCGTCAG 
ATCCAACATT GGGATGATTT ATGGCGCCCG GAATTAAAAA ATAATGTCAT GCTGATTGAT 
GGCGCTCGCG AAGTGTTAGG ATTATCTTTG AACAGTTTAG GCTATTCGTT AAACAGTAAA 
AAGGACCAAC AATTACGTCA GGCTACCGAT AAGTTAAACC GATTAACGAA CAATGTCAAA 
GCAATTGTTG CCGATGAAAT CAAAATGTAC ATGGCTAATG AAGAAAGTGC AGTTGCTGTA 
ACTTTCTCTG GTGAAGCTGC TGAAATGCTA GAAAACAATG AACATCTACA TTATGTGATT 
CCCAGTGAAG GCTCTAATCT CTGGTTTGAT AACATTGTGA TGCCTAAGAC AGCCAAAAAT 
AAAGAGGGTG CCTATGCATT TATGAACTTT ATGTTACGAC CAGAAAATGC GGCACAAAAT 
GCAGAATATA TTGGTTATTC CACACCAAAT AAAGAAGCTA AAAAACTATT ACCAAAAGAA 
GTTGCCGAAG ATAAACAATT TTATCCAGAT GATGAAACTA TCAAACATTT AGAAGTTTAC 
CAAGACTTAG GTCAAGAATA CTTAGGAATT TATAACGATC TGTTCTTGGA GTTTAAGATG 
TATCGGAAAT AA 

EF096-2 (SEQ ID NO:366) 

MKKLQ SLFIGIIAII VILFFGVRQL EKASGMAGAD TLTIYNWGDY IDPALIKKFE 
KETGYKVNYE TFDSNEAMYT KIQQGGTAYD lAIPSEYMIQ KMMKAKMLLP LDHSKLKGLE 
NIDARFLDQS FDPKNKFSVP YFWGTLGIIY NDKFIDGRQI QHWDDLWRPE LKNNVMLIDG 
AREVLGLSLN SLGYSLNSKN DQQLRQATDK LNRLTNNVKA IVADEIKMYM ANEESAVAVT 
FSGEAAEMLE NNEHLHYVIP SEGSNLWFDN IVMPKTAKNK EGAYAFMNFM LRPENAAQNA 
EYIGYSTPNK EAKKLLPKEV AEDKQFYPDD ETIKHLEVYQ DLGQEYIiGIY NDLFLEFKMY 
RK 

EF096-3 (SEQ ID NO:367) 
AAGTGGCAT GGCAGGAGCA 

GATACCTTGA CCATTTACAA TTGGGGGGAC TATATAGATC CGGCCTTGAT TAAGAAATTT 
GAAAAAGAAA CAGGCTATAA AGTCAATTAC GAAACCTTTG ATTCTAATGA AGCTATGTAT 
ACAAAAATTC AGCAAGGTGG CACAGCCTAT GATATTGCCA TTCCTTCTGA ATATATGATT 
CAAAAAATGA TGAAAGCGAA GATGCTTTTA CCACTTGATC ACAGCAAATT AAAAGGCTTA 
GAAAACATTG ATGCACGCTT TTTAGATCAA TCCTTTGATC CCAAAAATAA GTTTTCCGTT 
CCGTACTTCT GGGGCACGTT GGGGATTATT TATAATCATA AATTTATTGA CGGCCGTCAG 
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ATCCAACATT GGGATGATTT ATGGCGCCCG 
GGCGCTCGCG AAGTGTTAGG ATTATCTTTG 
AACGACCAAC AATTACGTCA GGCTACCGAT 
GCAATTGTTG CCGATGAAAT CAAAATGTAC 
ACTTTCTCTG GTGAAGCTGC TGAAATGCTA 
CCCAGTGAAG GCTCTAATCT CTGGTTTGAT 
AAAGAGGGTG CCTATGCATT TATGAACTTT 
GCAGAATATA TTGGTTATTC CACACCAAAT 
GTTGCCGAAG ATAAACAATT TTATCCAGAT 
CAAGACTTAG GTCAAGAATA CTTAGGAATT 
TATCGGAAA 



GAATTAAAAA ATAATGTCAT GCTGATTGAT 
AACAGTTTAG GCTATTCGTT AAACAGTAAA 
AAGTTAAACC GATTAACGAA CAATGTCAAA 
ATGGCTAATG AAGAAAGTGC AGTTGCTGTA 
GAAAACAATG AACATCTACA TTATGTGATT 
AACATTGTGA TGCCTAAGAC AGCCAAAAAT 
ATGTTACGAC CAGAAAATGC GGCACAAAAT 
AAAGAAGCTA AAAAACTATT ACCAAAAGAA 
GATGAAACTA TCAAACATTT AGAAGTTTAC 
TATAACGATC TGTTCTTGGA GTTTAAGATG 



EF096-4 (SEQ ID NO:368) 



SGMAGAD TLTIYrJWGDY IDPALIKKFE 
KETGYKVNYE TFDSNEAMYT KIQQGGTAYD 
NIDARFLDQS FDPKNKFSVP YFWGTLGIIY 
AREVLGLSLN SLGYSLNSKN DQQLRQATDK 
FSGEAAEMLE NNEHLHYVIP SEGSNLWFDN 
EYIGYSTPNK EAKKLLPKEV AEDKQFYPDD 
RK 



lAIPSEYMIQ KMMKAKMLLP LDHSKLKGLE 
ISfDKFIDGRQI QHWDDLWRPE LKNNVMLIDG 
LNRLTNNVKA IVADEIKMYM ANEESAVAVT 
IVMPKTAKNK EGAYAFMNFM LRPENAAQNA 
ETIKHLEVYQ DLGQEYLGIY NDLFLEFKMY 



EF097-1 (SEQ ID NO:369) 



TAGAAGTATT CTAATTATCT 
ATGCATTCGC TCTTTTTTAA 
GGTCATCGTT TGAGTGGGAT 
TTGTCTTTGG TGGCTGGCTA 
ACGATAATGA TTCGAGTTGT 
GAGGAACAAC GTGGCGGCGT 
GATGTTCCAC AGTTGTTTGG 
AAAATTGAAC AAATTCTCTT 
TTTTTAGCAG GAATTGTGGG 
GCTGTTGAAA GCGCTAGTTT 
CTTTTACCAT TGGTTCACGT 
ATTAACCATG GCTTATTAAC 
ATTTTATTTC TATTGGAAAC 
CTGTTTGGGC CTGTAGGACA 
GGGGGCATTC ATGAAATTTA 
GTAATTGCTG GAGGAATGAG 
GCTCCAGCTT CGCCAGGTTC 
CTGGCGGTTT TTAGCGGAAT 
TTATTAAAAC GTCAACGAGG 
CAAGTGGAAA CAGTCACACC 
GGCTCAAGTG CCATGGGGGC 
ATGCCTGTGA CTTACCAGTC 
ATTCAAGCAG AATTGAAACA 
GTTCAAAATT TTTTAGAAAT 
TCTTCTCAAG AGCAATCTTC 
ATACAGAAGC TTGTTTTTTT 
GAATTATTGC GGCAACAAGC 
CTGGAAACAG TCTTTTTTAC 
GCCTATCATT TAGATCTAAC 
AAAGAGTATC AAGAATGGCT 



ACATAGAGAG CGAGGGACAA 
ACATAAGTTT GTGAAAGTAA 
GATTATGCCA AATTTGAGTA 
TACGACTGGG AATCTACGGC 
TTTACCGATT CTAATTGGTT 
TGTTGCTGCT ATTGCGACAG 
TGCTATGTTT ATTGGCCCTT 
ACCGAAAGTT AAAGAAGGCT 
AGGACTGCTG TGCTGTTTTG 
TTGGCTGTAT CAATTTTCTT 
TTTCTTAGAG CCCTTAAAAG 
GCCTCTAGGT TTAGAAGGTG 
AAACCCTGGA CCAGGCGTGG 
ACGAAAAACA GCAGGAGGTG 
TTTTCCGTTT GTTTTGATGG 
TGGTACGCTT GTTTTTCAAA 
ATTGGTTGCG ATTTTAGCCA 
TTTTGTTAGC TTTCTGTGCT 
AATTGAACCA GTTTCAATGA 
TAACTATCAG CAAATTTTAT 
TAGTTTGCTA- AGCCGACAAT 
CGTTCATCAG ATGAAGTGGC 
GTTAGCACAA AAGTACGTCC 
TAAATCCTAT TACCCGCAAG 
ACTTGGTTCA GAGTCTACTG 
ATATGCCGAG AATGTTCGAG 
GGCGAAACAA GGAGTCGCGA 
CAAGGAGACA ACCTACGTAG 
GCAACAAAAT TTATACGTAG 
GGAAGGAGGA GCTGATAGAT 



GGAATATGAA GGAAAAAGAA 
CTCCCTATTT ACGTCGTTTT 
TTTTTATTGC GTGGAGCTTA 
TAGCTCTTTC TGAAGTCGAA 
TTACAGGCGG AAAAATGTTC 
TGGGCGTGAT TGTTTCCACA 
TAGCAGGATA TACTTTCGCC 
ACGAGATGCT GACTAAAAAC 
GTATTCTGGT TGTAGCTCCG 
CTTGGTTAAT TGAAGCCAAT 
TGTTATTTTT TAATAATGCG 
CTAGTCAAAC AGGTCAGTCC 
GCGTTTTGGT TGCTTTTCTG 
CCACCATGAT TCAACTGATT 
ACCCGCGCTT ATTTTTAGCA 
TATTTAATGT GGGTCTAAGT 
ATGCCCCGAC TGATGCGAGG 
CTTTTGCAAT AGCAAGCTTG 
TAAAGATGAA GGAGGAAGAC 
TTGTTTGTCA TGCAGGAATG 
TAAAAGCTGT GAACTTGGAG 
AGCCTAAGAC ATTAGTGGTC 
CAGAAAAGGA TATGGTGAGT 
TTTTAGCCAA ACTGACTGCT 
AAACGAACTC GACAAAACAA 
GATCGCAAAC AATGGGAATG 
TTGAAGTATC TAAAGAGCCA 
TGACTCGTGA ACTGGCGCAA 
TTACTAGTTT TTTGAATAAG 
GTTTTTAA 
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EF097-2 (SEQ ID NO:370) 
MLTKNF LAGIVGGLLC CFGILWAPA 

VESASFWLYQ FSSWLIEAJNL LPLVHVFLEP LKVLFFNNAI NHGLLTPLGL 
LFLLETNPGP GVGVLVAFLL FGPVGQRKTA GGATMIQLIG GIHEIYFPFV 
lAGGMSGTLV FQIFNVGLSA PASPGSLVAI LANAPTDARL AVFSGIFVSF 
LKRQRGIEPV SMIKMKEEDQ VETVTPNYQQ ILFVCDAGMG SSAMGASLLS 
PVTYQSVHQM KWQPKTLWI QAELKQLAQK YVPEKDMVSV QNFLEIKSYY 
SQEQSSLGSE STETNSTKQI QKLVFLYAEN VRGSQTMGME LLRQQAAKQG 
ETVFFTKETT YWTRELAQA YHLDLTQQNL YVVTSFLNKK EYQEWLEGGA 

EF097-3 (SEQ ID NO:37l) 

ACGAGG AATTGAACCA GTTTCAATGA TAAAGATGAA GGAGGAAGAC 
CAAGTGGAAA CAGTCACACC TAACTATCAG CAAATTTTAT TTGTTTGTGA TGCAGGAATG 
GGCTCAAGTG CCATGGGGGC TAGTTTGCTA AGCCGACAAT TAAAAGCTGT GAACTTGGAG 
ATGCCTGTGA CTTACCAGTC CGTTCATCAG ATGAAGTGGC AGCCTAAGAC ATTAGTGGTC 
ATTCAAGCAG AATTGAAACA GTTAGCACAA AAGTACGTCC CAGAAAAGGA TATGGTGAGT 
GTTCAAAATT TTTTAGAAAT TAAATCCTAT TACCCGCAAG TTTTAGCCAA ACTGACTGCT 
TCTTCTCAAG AGCAATCTTC ACTTGGTTCA GAGTCTACTG AAACGAACTC GACAAAACAA 
ATACAGAAGC TTGTTTTTTT ATATGCCGAG AATGTTCGAG GATCGCAAAC AATGGGAATG 
GAATTATTGC GGCAACAAGC GGCGAAACAA GGAGTCGCGA TTGAAGTATC TAAAGAGCCA 
CTGGAAACAG TCTTTTTTAC CAAGGAGACA ACCTACGTAG TGACTCGTGA ACTGGCGCAA 
GCCTATCATT TAGATCTAAC GCAACAAAAT TTATACGTAG TTACTAGTTT TTTGAATAAG 
AAAGAGTATC AAGAATGGCT GGAAGGAGGA GCTGATAGAT GTTTTT 

EF097-4 (SEQ ID NO:372) 

RGIEPV SMIKMKEEDQ VETVTPNYQQ ILFVCDAGMG SSAMGASLLS RQLKAVNLEM 
PVTYQSVHQM KWQPKTLWI QAELKQLAQK YVPEKDMVSV QNFLEIKSYY PQVLAKLTAS 
SQEQSSLGSE STETNSTKQI QKLVFLYAEN VRGSQTMGME LLRQQAAKQG VAIEVSKEPL 
ETVFFTKETT YWTRELAQA YHLDLTQQNL YVVTSFLNKK EYQEWLEGGA DRCF 

EF098-1 (SEQ ID NO:373) 

TAAATGAAAA AGACAAAAGT AATGACATTG ATGGCAACCA CAACTTTAGG CGCACTGGCA 
CTTGTACCAA TGAGTGCATT AGCAGTCGAC GGTGGTGAAT ACCAAACAAA CGGAGCGATT 
CAATTTGCAC CAAATACGAA CCCAACGAAT CCAGTTGATC CGACGAATCC AGACCCAGAT 
AAACCAATTA CACCAGTTGA TCCAACTGAT CCGACAGGGC CTAAGCCAGG GACAGCAGGT 
CCGTTATCCA TTGACTATGC ATCTAGCTTA TCTTTTGGGG AACAAACGAT TACCTCAAAA 
AATATGACCT ACTATGCAGA AACACAAAAA TACAAAGATA ACGCTGGTGC CGACCAAGAA 
GGCCCAAACT TTGTTCAAGT CTCAGATAAT CGTGGGACTG AGACAGGTTG GACGCTAAAA 
GTAAAACAAA ATGGTCAATT CAAAACTGAA GCCAACCAAG AACTAACAGC GGCCAAAGTA 
ACTTTAAGCA ACGGACGCGT GGTTTCAGCT TCACAATCTG CAAAGCCAAC GACAGCGCCA 
GCTACGATTG AATTAAACCC AACTGGGGCT GAATCAGTGG TCATGGCTGC TGGCGATAAA 
GAAGGTGCGG GTACGTACTT AATGAGCTGG GGCGATAGTG TAGATACCGC TAAAACAAGT 
ATTTCATTAG AAGTACCTGG TTCAACCACA AAATATGCGA AAAAATACAC GACAACTTTT 
ACTTGGACTT TGACAGATAC ACCTGCTAAC ACAGGAAACT AA 

EF098-2 (SEQ ID NO:374) 

MKKTKVMTLM ATTTLGALAL VPMSALAVDG GEYQTNGAIQ FAPNTNPTNP VDPTNPDPDK 
PITPVDPTDP TGPKPGTAGP LSIDYASSLS FGEQTITSKN MTYYAETQKY KDNAGADQEG 



EGASQTGQSI 
LMDPRLFLAV 
LCSFAIASLL 
RQLKAVNLEM 
PQVLAKLTAS 
VAIEVSKEPL 
DRCF 
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PNFVQVSDNR GTETGWTLKV KQNGQFKTEA NQELTAAKVT LSNGRWSAS QSAKPTTAPA 
TIELNPTGAE SWMAAGDKE GAGTYLMSWG DSVDTAKTSI SLEVPGSTTK YAKKYTTTFT 
WTLTDTPANT GN 

EF098-3 (SEQ ID NO:375) 

AGTCGAC GGTGGTGAAT ACCAAACAAA CGGAGCGATT 

CAATTTGCAC CAAATACGAA CCCAACGAAT CCAGTTGATC CGACGAATCC AGACCCAGAT 
AAACCAATTA CACCAGTTGA TCCAACTGAT CCGACAGGGC CTAAGCCAGG GACAGCAGGT 
CCGTTATCCA TTGACTATGC ATCTAGCTTA TCTTTTGGGG AACAAACGAT TACCTCAAAA 
AATATGACCT ACTATGCAGA AACACAAAAA TACAAAGATA ACGCTGGTGC CGACCAAGAA 
GGCCCAAACT TTGTTCAAGT CTCAGATAAT CGTGGGACTG AGACAGGTTG GACGCTAAAA 
GTAAAACAAA ATGGTCAATT CAAAACTGAA GCCAACCAAG AACTAACAGC GGCCAAAGTA 
ACTTTAAGCA ACGGACGCGT GGTTTCAGCT TCACAATCTG CAAAGCCAAC GACAGCGCCA 
GCTACGATTG AATTAAACCC AACTGGGGCT GAATCAGTGG TCATGGCTGC TGGCGATAAA 
GAAGGTGCGG GTACGTACTT AATGAGCTGG GGCGATAGTG TAGATACCGC TAAAACAAGT 
ATTTCATTAG AAGTACCTGG TTCAACCACA AAATATGCGA AAAAATACAC GACAACTTTT 
ACTTGGACTT TGACAGATAC ACCTGCTAAC ACAGGAAACT 

EF098-4 (SEQ ID NO:376) 

VDG GEYQTNGAIQ FAPNTNPTNP VDPTNPDPDK 

PITPVDPTDP TGPKPGTAGP LSIDYASSLS FGEQTITSKN MTYYAETQKY KDNAGADQEG 
PNFVQVSDNR GTETGWTLKV KQNGQFKTEA NQELTAAKVT LSNGRWSAS QSAKPTTAPA 
TIELNPTGAE SWMAAGDKE GAGTYLMSWG DSVDTAKTSI SLEVPGSTTK YAKKYTTTFT 
WTLTDTPANT GN 

EF099-1 (SEQ ID NO:377) 

TGATGTTGTA GAGGGCTGAT GAAATGTTTA TCAGTCTTCT TTTTATTGAA AGGAGAGATC 
ATGAAGAAAT TAGGCAAGGT TTTAATTGTT AGTTGTTTTA TTTTTATTCT TCCTTTTTTA 
TTATTTTTAG GTGTATTOTrC TTCTAGTGAA AGCGGAGATT CTTCCCAGTT TCAGCCCGCT 
ACACCACAGG AAAAAGTAGC ATTAGAAGTT TCTAACTACG TGACGTCACA TGGCGGAACG 
TTGCAGTTTG CTTCCGCTTG GATTGGCAAT ATGGAACATG AAAGTGGATT AAATCCTGCT 
AGAATTCAAA GTGATTTATC GTTTAATTCA GCGATAGCTT TTAATCCTTC GTTAGGCGGT 
TATGGAATTG GGTTAGGACA ATGGGATTCA GGACGAAGAG TTAATTTATT AAATTTTGCA 
AAAAGTCAAA AAAAGGAATG GAAATCAGTA GCTTTACAAA TGGATTTTGC GTGGAATAAG 
GATGGTTCTG ATAGTGACTT ACTTAAAAGA ATGTCTAAAT CAAAAGATGT GAATACACTT 
GCGGTAGATA TTTTGAAGCT GTGGGAACGA GCTGGAACAA AAGATGATCC CGCAGAACAA 
GTAAAAAGAA AGGCTAGTGC TAATAATTGG TATAAACGAC TTTCTACAGG TTCCATGGGC 
GGAGGTTCAG CCAATGTTGG TGGAGGAAAA ATTGATGCCT TGGAAAAAGT GATGGGGCAA 
ACTATTAATG GTGGTCAATG TTATGGCTTA TCTGCTTTTT TTGTTGAAAA ACAAGGAGGT 
CTACAAATGA TGGGTACGGG GCATATGTTT GCGAGTGAAA TTGGTAATGA TTATCCTTCG 
AGTTCAATTG GTTGGACAGT CATAAAGAAT CCAAATTATT CAGATATTAA AGCAGGAGAT 
GTCATTAATT TTGGTCAAGG TGGTGTGGCT ACTAGTATTT ATGGGCATAC TGGTGTAGTG 
GCAAGTGTTG AAGGTAAAAA CAAGTTTACT ACTTATGAGC AAAACGCTGA ACAAGGTCAA 
ATTGTTGCTA AGTATTTTCG GACTTGGGGA TTAGATTTTC CACATGTGAC CAGCATAGTA 
AGGAAATAG 

EF099-2 (SEQ ID NO:378) 

MKCLS VFFLLKGEIM KKLGKVLIVS CFIFILPFLL FLGVFSSSES GDSSQFQPAT 
PQEKVALEVS NYVTSHGGTL QFASAWIGNM EHESGLNPAR IQSDLSFNSA lAFNPSLGGY 
GIGLGQWDSG RRVNLLNFAK SQKKEWKSVA LQMDFAWNKD GSDSDLLKRM SKSKDVNTLA 
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VDILKLWERA GTKDDPAEQV KRKASANNWY KRLSTGSMGG GSANVGC3GKI DALEKVMGQT 
INGGQCYGLS AFFVEKQGGL QMMGTGHMFA SEIGNDYPWS SIGWTVIKNP NYSDIKAGDV 
INFGQGGVAT SIYGHTGWA SVEGKNKFTT YEQNAEQGQI VAKYFRTWGL DFPHVTSIVR 

K 

EF099-3 (SEQ ID NO:379) 

TAGTGAA AGCGGAGATT CTTCCCAGTT TCAGCCCGCT 

ACACCACAGG AAAAAGTAGC ATTAGAAGTT TCTAACTACG TGACGTCACA TGGCGGAACG 
TTGCAGTTTG CTTCCGCTTG GATTGGCAAT ATGGAACATG AAAGTGGATT AAATCCTGCT 
AGAATTCAAA GTGATTTATC GTTTAATTCA GCGATAGCTT TTAATCCTTC GTTAGGCGGT 
TATGGAATTG GGTTAGGACA ATGGGATTCA GGACGAAGAG TTAATTTATT AAATTTTGCA 
AAAAGTCAAA AAAAGGAATG GAAATCAGTA GCTTTACAAA TGGATTTTGC GTGGAATAAG 
GATGGTTCTG ATAGTGACTT ACTTAAAAGA ATGTCTAAAT CAAAAGATGT GAATACACTT 
GCGGTAGATA TTTTGAAGCT GTGGGAACGA GCTGGAACAA AAGATGATCC CGCAGAACAA 
GTAAAAAGAA AGGCTAGTGC TAATAATTGG TATAAACGAC TTTCTACAGG TTCCATGGGC 
GGAGGTTCAG CCAATGTTGG TGGAGGAAAA ATTGATGCCT TGGAAAAAGT GATGGGGCAA 
ACTATTAATG GTGGTCAATG TTATGGCTTA TCTGCTTTTT TTGTTGAAAA ACAAGGAGGT 
CTACAAATGA TGGGTACGGG GCATATGTTT GCGAGTGAAA TTGGTAATGA TTATCCTTGG 
AGTTCAATTG GTTGGACAGT CATAAAGAAT CCAAATTATT CAGATATTAA AGCAGGAGAT 
GTCATTAATT TTGGTCAAGG TGGTGTGGCT ACTAGTATTT ATGGGCATAC TGGTGTAGTG 
GCAAGTGTTG AAGGTAAAAA CAAGTTTACT ACTTATGAGC AAAACGCTGA ACAAGGTCAA 
ATTGTTGCTA AGTATTTTCG GACTTGGGGA TTAGATTTTC CACATGTGAC CAGCATAGTA 
AGGAAAT 

EF099-4 (SEQ ID NO;380) 
SES GDSSQFQPAT 

PQEKVALEVS NYVTSHGGTL QFASAWIGNM EHESGLNPAR IQSDLSFNSA lAFNPSLGGY 
GIGLGQWDSG RRVNLLNFAK SQKKEWKSVA LQMDFAWNKD GSDSDLLKRM SKSKDVNTLA 
VDILKLWERA GTKDDPAEQV KRKASANNWY KRLSTGSMGG GSANVGGGKI DALEKVMGQT 
INGGQCYGLS AFFVEKQGGL QMMGTGHMFA SEIGNDYPWS SIGWTVIKNP NYSDIKAGDV 
INFGQGGVAT SIYGHTGWA SVEGKNKFTT YEQNAEQGQI VAKYFRTWGL DFPHVTSIVR 
K 

EFlOO-1 (SEQ ID NO:381) 

TANTTATGGC AATATGGAAG GAGTTTTATA ATGAAAAAGA AACAAAAATA CGCAGGGTTT 
ACATTATTAG AAATGTTGAT TGTCTTATTG ATTATTTCCG TATTGATTTT ACTTTTTGTC 
CCTAACTTAG CGAAACATAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 
ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGAGGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AGCAGAAAAG 
CAATGA 

EFlOO-2 (SEQ ID NO: 382) 

MKKKQKYAGF TLLEMLIVLL IISVLILLFV PNLAKHKETV DKKGNEAIVK 
IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 

EFlOO-3 (SEQ ID NO: 383) 

TAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 

ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AGCAGAAAAG 
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CAAT 

EFlOO-4 (SEQ ID NO:384) 
KETV DKKGNEAIVK 

IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 
EFlOO-1 {SEQ ID NO:385) 

TANTTATGGC AATATGGAAG GAGTTTTATA ATGAAAAAGA AACAAAAATA CGCAGGGTTT 
ACATTATTAG AAATGTTGAT TGTCTTATTG ATTATTTCCG TATTGATTTT ACTTTTTGTC 
CCTAACTTAG CGAAACATAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 
ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AGCAGAAAAG 
CAATGA 

EFlOO-2 {SEQ ID NO:386) 

MKKKQKYAGF TLLEMLIVLL IISVLILLFV PNLAKHKETV DKKGNEAIVK 
IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 

EFlOO-3 {SEQ ID NO:387) 

TAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 
ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC 
CAAT 

EFlOO-4 (SEQ ID NO:388) 
KETV DKKGNEAIVK 

IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 

EFlOl-1 {SEQ ID NO:389) 

TGAGGAGATG AAACGAAGAA AATGAAGAAG AAAACGATAA TTATATTGGG GGCAGTTGCG 
GTAATTGCGG TTGGGGGCAT CGTAACTGTG AATGCGTTAA ATAAAAATGC ACAACAAGTA 
GCTGTCAAGC AAGCGCCTAA AGATGACTGG GGAATTGACT ATTTTGACGT TCCCGACTTG 
CAACAAATTT ATATTAACGG TGTCATCCAA CCGGAACAAA TCGAAGCCTT TGCGCGTGAT 
CAAAAAATAA CAAAGGATCC AGAGATTAAG GTGAAAAACG GCGATGTCGT AGATCCAGGC 
ACAGAATTAT TTACTTATGA AGATGAGGCG GTCACAAAAG AAATTGAGGC ACAACAAAAT 
AGCTTAGCCA AATTAGAAAC GAAGCGGGCG AATATCTATA ATAAGTGGAA TCGGGCCATT 
GATAAATTTA ATAAAACTAA AGAAGAAGAC CGCACGATGT CTGGTGATGA TTTAAATGAA 
CAATATCAAA CAGAAGTCGA TGCAGTAGAT GAAGAGATTA CCTTCACCAA TGAAACCTTA 
GCGGATTTAG GAGCGAAGCA ATATATTTCC ACAAAGGCTA ATTTCAAAGG TCGTCTATCA 
ATTCCAGAAG TAAAAGATGC CAATTCACCG ATTTTACGGT TAACTTCAGA AGATCTTTAT 
TTAGCTGGAA AAGTGAATGA AAAGGACTTG ACTAAAATTA GTGTTGGGCA AAAAGCTAAA 
CTAACTTCTG TTTCCAACAA TGTGGTTGTG GATGGCTCAA TTTCTTACAT CGATGATAAT 
CCTCCTGAAG GCAACAGCGA TGCCGCGAGT GGCAATCCAG AGGGCGGCAC AACGATGTCT 
AGTTATAGCG TCAAAATTGC GTTGGCCAAT TTAGACAAAG TCAAAAATGG CTACCATATG 
CAAGCAACCA TTGATTTAGG CGATTTAGGG GCGATTGAGT TACCGAAAAA AGCGATTCAA 
AAAGAGGGTG AACAGGCCTA CGTTTTAGTG AATGATTTTG GAACCATCAT TCGTCGTGAT 
GTCCAAGTCG GGCAAGAAAA TGGCGACAAA ATGGCGATTG AATCTGGCTT AGAATCAGCC 
GACCGAGTGG TTATTTCTTC AAAAAAACCA GTAAAAGTCG GTGATATTGT TGAATCAGAT 



TTCCTTAAAT 
AGCAGAAAAG 
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GCAGCGATTG CTTCTGATGA ATCAGCAACC AACGAATCAA TGACAGATGC GTCGAAATAG 
EFlOl-2 (SEQ ID NO:390) 

MKKK TIIILGAVAV lAVGGIVTVN ALNKNAQQVA VKQAPKDDWG IDYFDVPDLQ 
QIYINGVIQP EQMEAFARDQ KITKDPEIKV KNGDWDAGT ELFTYEDEAV TKEIEAQQNS 
LAKLETKRAN lYNKWNRAID KFNKTKEEDR TMSGDDLNEQ YQTEVDAVDE EITFTNETLA 
DLGAKQYIST KANFKGRVSI PEVKDANSPI LRLTSEDLYL AGKVNEKDLT KISVGQKAKL 
TSVSNNVWD GSISYIDDNP PEGNSDAASG NPEGGTTMSS YSVKIALANL DKVKNGYHMQ 
ATIDLGDLGA lELPKKAIQK EGEQAYVLVN DFGTIIRRDV QVGQENGDKM AIESGLESAD 
RWISSKKPV KVGDIVESDA AIASDESATN ESMTDASK 

EFlOl-3 (SEQ ID NO:391) 

TAAAAATGC ACAACAAGTA 

GCTGTCAAGC AAGCGCCTAA AGATGACTGG GGAATTGACT ATTTTGACGT TCCCGACTTG 
CAACAAATTT ATATTAACGG TGTCATCCAA CCGGAACAAA TGGAAGCCTT TGCGGGTGAT 
CAAAAAATAA CAAAGGATCC AGAGATTAAG GTGAAAAACG GCGATGTCGT AGATGCAGGC 
ACAGAATTAT TTACTTATGA AGATGAGGCG GTCACAAAAG AAATTGAGGC ACAACAAAAT 
AGCTTAGCCA AATTAGAAAC GAAGCGGGCG AATATCTATA ATAAGTGGAA TCGGGCCATT 
GATAAATTTA ATAAAACTAA AGAAGAAGAC CGCACGATGT CTGGTGATGA TTTAAATGAA 
CAATATCAAA CAGAAGTCGA TGCAGTAGAT GAAGAGATTA CCTTCACCAA TGAAACCTTA 
GCGGATTTAG GAGCGAAGCA ATATATTTCC ACAAAGGCTA ATTTCAAAGG TCGTGTATCA 
ATTCCAGAAG -TAAAAGATGC CAATTCACCG ATTTTACGGT TAACTTCAGA AGATCTTTAT 
TTAGCTGGAA AAGTGAATGA AAAGGACTTG ACTAAAATTA GTGTTGGGCA AAAAGCTAAA 
CTAACTTCTG TTTCCAACAA TGTGGTTGTG GATGGCTCAA TTTCTTACAT CGATGATAAT 
CCTCCTGAAG GCAACAGCGA TGCCGCGAGT GGCAATCCAG AGGGCGGCAC AACGATGTCT 
AGTTATAGCG TCAAAATTGC GTTGGCCAAT TTAGACAAAG TCAAAAATGG CTACCATATG 
CAAGCAACCA TTGATTTAGG CGATTTAGGG GCGATTGAGT TACCGAAAAA AGCGATTCAA 
AAAGAGGGTG AACAGGCCTA CGTTTTAGTG AATGATTTTG GAACCATCAT TCGTCGTGAT 
GTCCAAGTCG GGCAAGAAAA TGGCGACAAA ATGGCGATTG AATCTGGCTT AGAATCAGCC 
GACCGAGTGG TTATTTCTTC AAAAAAACCA GTAAAAGTCG GTGATATTGT TGAATCAGAT 
GCAGCGATTG CTTCTGATGA ATCAGCAACC AACGAATCAA TGACAGATGC GTCGAAAT 



EFlOl-4 (SEQ ID NO:392) 

KNAQQVA VKQAPKDDWG IDYFDVPDLQ 
QIYINGVIQP EQMEAFARDQ KITKDPEIKV 
LAKLETKRAN lYNKWNRAID KFNKTKEEDR 
DLGAKQYIST KANFKGRVSI PEVKDANSPI 
TSVSNNVWD GSISYIDDNP PEGNSDAASG 
ATIDLGDLGA lELPKKAIQK EGEQAYVLVN 
RWISSKKPV KVGDIVESDA AIASDESATN 



KNGDWDAGT ELFTYEDEAV TKEIEAQQNS 
TMSGDDLNEQ YQTEVDAVDE EITFTNETLA 
LRLTSEDLYL AGKVNEKDLT KISVGQKAKL 
NPEGGTTMSS YSVKIALANL DKVKNGYHMQ 
DFGTIIRRDV QVGQENGDKM AIESGLESAD 
ESMTDASK 



EF102-1 (SEQ ID NO:393) 

TAAACATTTG AGACATTCAG AGGTGAATGT CTCTTTTTTA TTACTCAAAA ACGAAAGGGG 
ATTAATTATA TGAAAAAAAC AACATTTAAA AATTGGTCGT TATTTGCGAC TTTGGCTCTA 
TTAAGTCAAA CAATTGGCGG AACGATTGGT CCTACGATTG CTTTTGCCGA TGAAATTACT 
CACCCTCAAG AGGTAACAAT TCATTATGAC GTAAGTAAAC TGTATGAAGT TGACGGAACT 
TTTAGCGATG GCAGCACGCT CTCAGAACGT ACTACGTCAT TATATGCAGA ATACAATGGT 
GCAAAACAAA CAGTATTTTG TATTGAACCA GGTGTTAGTA TTCCAACAGA AGTGACGCAC 
GGTTATCAGA AAAACCCTTT GCCATCAATG TCTGATAAAG CGAAACTAGT ATCGGTTCTT 
TGGGAAAAGG CTGGAACAGA TATTGATACA AATATGGTTG CACAAAAGAT GATTTGGGAA 
GAAGTGAACG GTTATAAACT CCATTCCATA AAAAGATTAG GTGGTGCTTC AGTTGATATA 
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AAATCTATTG AAGGAAAAAT TAATAAGGCA ATTGAGGAGT ATCAAAAAAA ACCAAGTTTT 
CATAATACCA CTGTAAAAAC AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA 
TTAAATTTAT CTGAGTTTGA TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA 
ATTGGGAATC AATTAGTGCT TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG 
AAAAAATCAG CTGGTACTGG AACTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG 
ATGGCTGGTG CGCTTGATAA GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG 
GGTTCTTTAA AGATCAAAAA AATCGATAAA GAATCAGGTG ATATTGTACC AGAAACGGTT 
TTCCATTTAG ATTTTGGGAA AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGG 
ATTTCTATTT TGGATGGAAT TCCCCATGGT ACAAAGGTAA CTATTACTGA AAAATCGGTG 
CCAGATCCTT ATATGATTGA TACCACACCC ATGGCTGCCA CCATTAAAGC GGGCGAGACC 
ATTTCCATGA CTTCGAAAAA TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG 
GTAGAAACAG GTACTGATCT TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC 
ATTCGTAAAG ACAGCCCAGC TGGTGAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT 
CGTGCGGAAA CACCAAAAGA GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA 
GAAACTAAAT CTAGTAATGG TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA 
TATGCCAATC AAACCGTGGC TCTTGTTACC AGTAACGTAA AAGGGCAAAA CCAAGAAATT 
ACTGGGGAAA CCACTTTGAC AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAAGGGAAA 
GCTGAGTTTA AAGGAGCTGA ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA 
TGGAGTGAAG CTTTTAAAAC AGAATTAGTG AAGGGAACGA AAGCTTCTGA TGAAACAGTG 
ACTTTGGCTT TAGATGAAAA GAACCAAGTT GCCGTTAAAC ACCTAGCAAT TAACGAGTAT 
TTCTGGCAAG AAACCAAAGC ACCTGAAGGA TATACTTTGG ATGAAACGAA GTATCCTGTA 
TCCATCAAAA AAGTTGATAA TAACGAAAAA AATGCCGTAA TTACTCGAGA TGTTACGGCA 
AAAGAACAAG TTATTCGCTT TGGCTTTGAT TTCTTTAAAT TTGCTGGATC GGCTGATGGC 
ACTGCCGAAA CTGGATTTAA CGACTTATCT TTTAAAGTGT CGCCATTGGA AGGGACCAAN 
GAAATCACAG GTGCTGAAGA TAAAGCGACC ACAGCTTGTA ACGAGCAATT AGGTTTTGAT 
GGCTATGGTA AGTTTGAAAA TCTTCCTTAT GGGGATTATT TACTTGAAGA AATAGAGGCT 
CCAGAAGGAT TTCAAAAGAT TACACCACTA GAAATCCGTT CTACATTTAA GGAAAACAAA 
GACGACTATG CGAAGAGTGA GTATGTCTTT ACCATTACCG AAGAAGGACA AAAACAACCA 
ATTAAGATGG TGACCGTTCC TTACGAGAAA CTAACTAACA ACGAGTTTTC TGTTAGTCTG 
AACCGTTTGA TGCTTTATGA TTTGCCCGAG AAAGAAGATA GTTTGACTTC TCTTGCGACT 
TGGAAAGACG GAT^TAAAAA ATTGAATACC CTTGATTTTA CCGAGCTAGT TGATAAATTG 
AGATATAACT TGCATGAAAT CAAAGAAGAC TGGTATGTCG TAGCTCAAGC CATTGATGTG 
GAAGCCACAA AAGCTGCCCA AGAAAAAGAC GAAAAAGCCA AACCGGTGGT GATTGCCGAA 
ACAACCGCAA CGTTGGCGAA CAAAGAGAAA ACTGGAACTT GGAAAATTCT GCATAAATTA 
ACCGCTGAAC AAGTTTTGGA TAAAAGCATC GTCTTGTTCA ATTATGTGTA TGAAAACAAG 
GTAGCCTTTG AAGCAGGCAA TGAGCCAGTA GCGAAGGATG CTAGCTTGAA CAATCAAGCA 
CAAACCGTCA ATTGTACGAT TGAACGCCAT GTTTCCATCC AAACAAAAGC CCACCTAGAA 
GATGGTTCGC AAACTTTTAC TCATGGTGAC GTGATGGATA TGTTTGATGA TGTGTCGGTT 
ACCCATGATG TACTGGATGG CTCAAAAGAA GCTTTCGAAA CAATTCTGTA TGCTTTACTA 
CCAGATGGTA CGAACAAAGA AATTTGGAAA TCTGGCAAAA TTGAGCATGA AGTGAATGAT 
AAAGAATTTA CCAAAACCGT ACTTGCGGAA AAAGTAGATA CCGGAAAGTA TCCAGAAGGA 
ACTAAGTTTA CTTTTACGGA AATCAATTAC GAAAAAGAOXS GAAACGTGAA TGGAAAACAC 
AATGAAGATT TGAAAGAAAA ATCTCAAACC TTAACACCAA AAGAAGTGCC AACCATACCG 
AGTACGCCAA AACAACCGGA AACACCAGCT GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC 
ACAGTGAAGA CATTCCCGCA AACTGGGGAG AAAAATTCCA ACGTTCTACT GTTAGTTGGC 
TTTATCTTGA TTTTTTCGAC TGCTGGGTAT TATTTCTGGA ATCGCCGCAA TTAA 

EF102-2 (SEQ ID NO:394) 

MKKTTFKN WSLFATLALL SQTIGGTIGP TIAFADEITH 

PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA KQTVFCIEPG VSIPTEVTHG 
YQKNPLPSMS DKAKLVSVLW EKAGTDIE>TN MVAQKMIWEE VNGYKLHSIK RU3GASVDIK 
SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL NLSEFDKWQ NTANIDYRVI 
GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM AGALDKPNTY AIKINVETKG 
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SLKIKKIDKE SGDIVPETVF HLDFGKALPS KDVTTDKDGI SILDGIPHGT KVTITEKSVP 
DPYMIDTTPM AATIKAGETI SMTSKNMRQK GQILLEKTGV ETGTDLWNDN YSLAGNTFAI 
RKDSPAGEIV QEITTDEKGR AETPKELANA LELGTYYVTE TKSSNGFVNT FKPTKVELKY 
ANQTVALVTS NVKGQNQEIT GETTLTKEDK DTGNESQGKA EFKGAEYTLF TAKDGQAVKW 
SEAFKTELVK GTKASDETVT LALDEKNQVA VKHLAINEYF WQETKAPEGY TLDETKYPVS 
IKKVDNNEKN AVITRDVTAK EQVIRFGFDF FKFAGSADGT AETGFNDLSF KVSPLEGTXE 
ITGAEDKATT ACNEQLGFDG YGKFENLPYG DYLLEEIEAP EGFQKITPLE IRSTFKENKD 
DYAKSEYVFT ITEEGQKQPI KMVTVPYEKL TNNEFSVSLN RLMLYDLPEK EDSLTSLATW 
KDGNKKLNTL DFTELVDKLR YNLHEIKEDW YWAQAIDVE ATKAAQEKDE KAKPWIAET 
TATLANKEKT GTWKILHKLT AEQVLDKSIV LFNYVYENKV AFEAGNEPVA KDASLNNQAQ 
TVNCTIERHV SIQTKAHLED GSQTFTHGDV MDMFDDVSVT HDVLDGSKEA FETILYALLP 
DGTNKEIWKS GKIEHEVNDK EFTKTVLAEK VDTGKYPEGT KFTFTEINYE KDGNVNGKHN 
EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VKTFPQTGEK NSNVLLLVGF 
ILIFSTAGYY FWNRRN 

EF102-3 (SEQ ID NO:395) 



TT TAGATGAAAA GAACCAAGTT GCCGTTAJ 
TTCTGGCAAG AAACCAAAGC ACCTGAAGGA 
TCCATCAAAA AAGTTGATAA TAACGAAAAA 
AAAGAACAAG TTATTCGCTT TGGCTTTGAT 
ACTGCCGAAA CTGGATTTAA CGACTTATCT 
GAAATCACAG GTGCTGAAGA TAAAGCGACC 
GGCTATGGTA AGTTTGAAAA TCTTCCTTAT 
CCAGAAGGAT TTCAAAAGAT TACACCACTA 
GACGACTATG CGAAGAGTGA GTATGTCTTT 
ATTAAGATGG TGACCGTTCC TTACGAGAAA 
AACCGTTTGA TGCTTTATGA TTTGCCCGAG 
TGGAAAGACG GAAATAAAAA ATTGAATACC 
AGATATAACT TGCATGAAAT CAAAGAAGAC 
GAAGCCACAA AAGCTGCCCA AGAAAAAGAC 
ACAACCGCAA CGTTGGCGAA CAAAGAGAAA 
ACCGCTGAAC AAGTTTTGGA TAAAAGCATC 
GTAGCCTTTG AAGCAGGCAA TGAGCCAGTA 
CAAACCGTCA ATTGTACGAT TGAACGCCAT 
GATGGTTCGC AAACTTTTAC TCATGGTGAC 
ACCCATGATG TACTGGATGG CTCAAAAGAA 
CCAGATGGTA CGAACAAAGA AATTTGGAAA 
AAAGAATTTA CCAAAACCGT ACTTGCGGAA 
ACTAAGTTTA CTTTTACGGA AATCAATTAC 
AATGAAGATT TGAAAGAAAA ATCTCAAACC 
AGTACGCCAA AACAACCGGA AACACCAGCT 
ACAGTGAAGA 



lAC ACCTAGCAAT TAACGAGTAT 
TATACTTTGG ATGAAACGAA GTATCCTGTA 
AATGCCGTAA TTACTCGAGA TGTTACGGCA 
TTCTTTAAAT TTGCTGGATC GGCTGATGGC 
TTTAAAGTGT CGCCATTGGA AGGGACCAAN 
ACAGCTTGTA ACGAGCAATT AGGTTTTGAT 
GGGGATTATT TACTTGAAGA AATAGAGGCT 
GAAATCCGTT CTACATTTAA GGAAAACAAA 
ACCATTACCG AAGAAGGACA AAAACAACCA 
CTAACTAACA ACGAGTTTTC TGTTAGTCTG 
AAAGAAGATA GTTTGACTTC TCTTGCGACT 
CTTGATTTTA CCGAGCTAGT TGATAAATTG 
TGGTATGTCG TAGCTCAAGC CATTGATGTG 
GAAAAAGCCA AACCGGTGGT GATTGCCGAA 
ACTGGAACTT GGAAAATTCT GCATAAATTA 
GTCTTGTTCA ATTATGTGTA TGAAAACAAG 
GCGAAGGATG CTAGCTTGAA CAATCAAGCA 
GTTTCCATCC AAACAAAAGC CCACCTAGAA 
GTGATGGATA TGTTTGATGA TGTGTCGGTT 
GCTTTCGAAA CAATTCTGTA TGCTTTACTA 
TCTGGCAAAA TTGAGCATGA AGTGAATGAT 
AAAGTAGATA CCGGAAAGTA TCCAGAAGGA 
GAAAAAGATG GAAACGTGAA TGGAAAACAC 
TTAACACCAA AAGAAGTGCC AACCATACCG 
GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC 



EF102-4 (SEQ ID NO:396) 



LDEKNQVA VKHLAINEYF WQETKAPEGY T] 
IKKVDNNEKN AVITRDVTAK EQVIRFGFDF 
ITGAEDKATT ACNEQLGFDG YGKFENLPYG 
DYAKSEYVFT ITEEGQKQPI KMVTVPYEKL 
KDGNKKLNTL DFTELVDKLR YNLHEIKEDW 
TATLANKEKT GTWKILHKLT AEQVLDKSIV 
TVNCTIERHV SIQTKAHLED GSQTFTHGDV 
DGTNKEIWKS GKIEHEVNDK EFTKTVLAEK 



FKFAGSADGT AETGFNDLSF KVSPLEGTXE 
DYLLEEIEAP EGFQKITPLE IRSTFKENKD 
TNNEFSVSLN RLMLYDLPEK EDSLTSLATW 
YWAQAIDVE ATKAAQEKDE KAKPWIAET 
LFNYVYENKV AFEAGNEPVA KDASLNNQAQ 
MDMFDDVSVT HDVLDGSKEA FETILYALLP 
VDTGKYPEGT KFTFTEINYE KDGNVNGKHN 
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EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VK 
EF103-1 (SEQ ID NO:397) 

TAAGATAGGT TTATCAAAGA AAAGGAGCGA TGCTTTATGA AAAAGAAAGT ATTAAGTTCG 
ATTACTTTAG TAACATTAAG TACGTTACTT ATAGCAGGTT ATGCAAGTCC AGCATTTGCA 
GATCATGCAG CCAATCCAAA TAGTGCTACA GCAAATTTAG GCAAACATCA AAACAATGGC 
CAAACAAGAG GCGACAAGGC GACTAAGATT TTATCTGGCA CGGACTGGCA AGGAACCCGT 
GTTTATGATG CTGCTGGTAA TGATTTAACG GCAGAAAATG CTAATTTTAT TGGTTTAGCA 
AAATATGATG GTGAAACCGG TTTTTACGAG TTTTTCGACA AAAATACTGG GGAAACCCGT 
GGTGACGAAG GAACATTTTT TGTGACAGGT GATGGCACAA AACGAATTTT AATTTCGCGG 
ACACAAAATT ATCAAGCCGT AGTGGATTTA ACCGAAGTGA GTAAAGACNA ATTTACTTAC 
AAGCGTTTAG GGAAAGATAA ACTGGGGAAT GATGTTGAAG TTTACGTGGA ACACATCCCT 
TATCATGGGA AAAAATTAGC TTTTACAAAT GGACGTGAAG CATTAACCAA TCAAACTGGC 
AAAATTGTGA CAAATAAATC AGGGGATAAA ATTTTAGGAA CAACCTTGTG <3AATGGCACA 
AAAGTCGTAG ATAAAAACGG TAATGATGTG ACAGCGGCCA ATCAAAATTT CATTAGTTTA 
GCGAAATTTG ATCCAAACAC AAGTAAATAT GAATTTTTCA ATTTACAAAC AGGTGAAACC 
CGCGGCGACT TTGGGTACTT CCAAGTGGTG GACAATAACA AGATTCGGGC CCATGTATCT 
ATTGGTACGA ATCGTTACGG CGCGGCGCTA GAATTAACGG AACTAAACAA TGATCGATTT 
ACGTATACTC GAATGGGTAA AGATAATGCT GGTAATGATA TTCAAGTGTT CGTGGAACAT 
GAACCTTACC AAGGCACATA TCATCCAGCC TTTACTTTCT AA 



EF103-2 (SEQ ID NO:398) 



MKKKVLSSI TLVTLSTLLI AGYASPAFAD HAANPNSATA NLGKHQNNGQ 
TRGDKATKIL SGTDWQGTRV YDAAGNDLTA ENANFIGLAK YDGETGFYEF FDKNTGETRG 
DEGTFFVTGD GTKRILISRT QNYQAWDLT EVSKDXFTYK RLGKDKIiGND VEVYVEHIPY 
HGKKLAFTNG REALTNQTCK IVTNKSGDKI LGTTLWNGTK WDKNGNDVT AANQNFISLA 
KFDPNTSKYE FFNLQTGETR GDFGYFQWD NNKIRAHVSI GTNRYGAALE LTELNNDRFT 
YTRMGKDNAG NDIQVFVEHE PYQGTYHPAF TF 



EF103-3 (SEQ ID NO:399) 



TCATGCAG CCAATCCAAA TAGTGCTACA G< 
CAAACAAGAG GCGACAAGGC GACTAAGATT 
GTTTATGATG CTGCTGGTAA TGATTTAACG 
AAATATGATG GTGAAACCGG TTTTTACGAG 
GGTGACGAAG GAACATTTTT TGTGACAGGT 
ACACAAAATT ATCAAGCCGT AGTGGATTTA 
AAGCGTTTAG GGAAAGATAA ACTGGGGAAT 
TATCATGGGA AAAAATTAGC TTTTACAAAT 
AAAATTGTGA CAAATAAATC AGGGGATAAA 
AAAGTCGTAG ATAAAAACGG TAATGATGTG 
GCGAAATTTG ATCCAAACAC AAGTAAATAT 
CGCGGCGACT TTGGGTACTT CCAAGTGGTG 
ATTGGTACGA ATCGTTACGG CGCGGCGCTA 
ACGTATACTC GAATGGGTAA AGATAATGCT 
GAACCTTACC AAGGCACATA TCATCCAGCC 

EF103-4 (SEQ ID NO:400) 



:AAATTTAG GCAAACATCA AAACAATGGC 
TTATCTGGCA GGGACTCGCA AGGAACCCGT 
GCAGAAAATG CTAATTTTAT TGGTTTAGCA 
TTTTTCGACA AAAATACTGG GGAAACCCGT 
GATGGCACAA AACGAATTTT AATTTCGCGG 
ACCGAAGTGA GTAAAGACNA ATTTACTTAC 
GATGTTGAAG TTTACGTGGA ACACATCCCT 
GGACGTGAAG CATTAACCAA TCAAACTGGC 
ATTTTAGGAA CAACCTTGTG GAATGGCACA 
ACAGCGGCCA ATCAAAATTT CATTAGTTTA 
GAATTTTTCA ATTTACAAAC AGGTGAAACC 
GACAATAACA AGATTCGGGC CCATGTATCT 
GAATTAACGG AACTAAACAA TGATCGATTT 
GGTAATGATA TTCAAGTGTT CGTGGAACAT 
T 



HAANPNSATA NLGKHQNNGQ 

TRGDKATKIL SGTDWQGTRV YDAAGNDLTA ENANFIGLAK YDGETGFYEF FDKNTCETRG 
DEGTFFVTGD GTKRILISRT QNYQAWDLT EVSKDXFTYK RLGKDKLGND VEVYVEHIPY 
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HGKKLAFTNG REALTNQTGK IVTNKSGDKI LGTTLWNGTK WDKNGNDVT AANQNFISLA 
KFDPNTSKYE FFNLQTGETR GDFGYFQWD NNKIRAHVSI GTNRYGAALE LTELNNDRFT 
YTRMGKDNAG NDIQVFVEHE PYQGTYHPA 

EF104-1 (SEQ ID NO:401) 

TGAAAGGGGA TTAGTATGAA GAAAAAAACT TTTTCTTTTG TGATGTTGAG TATACTTCTC 
GCACAAAATT TCGGGTTTGC CGTAAATGCC TATGCTGTAA CAACGACAGA AGCACAAACA 
GAGACCACTG ATACAGCAAA AAAAGAGGCA GAGTTATCGA ACTCAACACC ATCTTTACCT 
TTAGCAACAA CGACTACTTC AGAAATGAAT CAACCAACTG CAACAACTGA ATCGCAAACC 
ACAGAGGCGA GCACAACAGC TTCCAGTGAT GCTGCTACAC CATCTGAACA ACAAACAACG 
GAGGACAAGG ACACCTCACT TAATGAAAAA GCCCTGCCAG ATGTTCAAGC GCCAATTACA 
GATGAACTAC TTGACAGTAT GAGTCTTGCG CCGATTGGTG GAACAGAATA CAGCCAAACA 
GAGGTTCACC GCGAATTAAA TACAACACCG GTAACCGCTA CGTTCCAATT TGCTGTTGGA 
AACACAGGTT ATGCACCTGG ATCAGTTTAT ACAGTTCAAT TACCAGAACA TTTAGGTTAT 
TCAACTGTCA GCGGAGAAGT GACAGGCATT GGCGCAACTT GGGCAGTCGA TGCGGCGACC 
AAAACATTAA GTATTACGTT TAATCAACGA GTTTCAGATA CTTCCTTTAA AGTAGAACTA 
AAAAGTTATC TAACAACAGA GGCGGAACCA TTAATCAAAA TTGAAACTCC AGGAAAAAAT 
AAAAAAACCT ACTCGTTTGA TTTATATGAA CAAGTGGAAC CAATTCAATA TAACGAACGA 
ACCAGAACGA CGGGGTTAGA TGGCGAAATT TTTTATAATT TAGACCGGAC GTTAACTGGC 
AATCAAACAT TAGAATTATT AACAACAGAG ACGCCAGGCG CTGTCTTTGG AAAACAAGAT 
AACTTGGAAC CTCAAGTTTT CAGTTACGAT GTCGACATTA ATGGTCAAAT TTTACCAGAA 
ACGCAAACCT TGTTAACACC TGGCAAAGAT TATACATTAA GCGATAATTC ACTCGGGCGG 
ATTGCTGTAA CTGTTCCAAA CATGAATCAA CAAAAAGCCT ATTCCTTATC GATTAATCGG 
ACAATTTATT TAGAGAGTGC TTCGGACTAT AACTACTTAT ATTCGCAGCA GTATCCAACA 
ACAAAAATTG GGTCAATTTC TTTGAAAAGT ACGACAGGAA CTAAACAAAC AACCGATTTT 
ACTGCTAAGA CGAGTCAAAC AAGTAAAGTA ATTGCTGATC GTGAAATGCG TAGTATGTCC 
TATATCAGTT TTCAAAGCAA AGGGAAATAT TATGTAACAA TTTATGGCAC GTTAACAGAA 
ACAAAAGTGG GTCAACAAAT CGTATTAGAG AGTACAAACG GTCAAGAAAT TAAGAATCCT 
AAATTTACGG CGTATGGTCC TTTATATGAA AATGTAAAAT TGGAAGACTA TTTTGATATT 
AAAACTGAAG GTGGCAAGCT CACTTTAACG GCCACAAAAG ATAGCTATTT AAGAATAAAT 
ATTTCTGATT TAACAATGGA TTTTGACAAG AAGGACATTA ATCTATCATT AAGTACACCT 
GTAATTGGTC CTAATAAAGC CATTCAATTA GTATCCGATC AATATATTGA ACCAATTAGT 
GTTGTTAATC CTTTGAATGC TGAAACTGCT TGGGGTAATT ATGATCAAAA TGGTGCCTAT 
TCATCAAGAA CAACTGTCTC AGTTATGGGA AGCAAAGAGA AACCGATTCA AAATTTAGAA 
ATTAAAGTAA AGCATCCTAA TTATCTTTCA TTACGAGCTA CAAAAGAAAT TTATTTTTAT 
TACAAGTTAG GAACGGATTA TACAGTAACG CCAACGTCAG ATGGTTCAGT TATTAAGTTC 
ACTACGCCAA TAACGAACGA AATCCAAATT CCAATTGGTT TTAATTATGT GCCAGATAGT 
TTGCCAAAAG ATAAAAGTAT CCCAGTCGAT ACGATACCGA TAACAATGAG TGCTGAAGGT 
TTAACTCCAG TTGATACGAC AGTAACTACT AATAGTAAGC GTGGTTCTGA ACGAACACTT 
CAAAGTAGTA AAAATCAATT CCTTGTCAAT GCACGAAATG ATTCTTTTGA CTCACTAAGC 
GTCCGTACAA AAATTCCAGC TGGCGCCGAT GTTCTTTTTG ACATTTATGA TGTTTCAAAC 
GATCAGGTAG ATTCAATTTA TCCACAATAC TGGGACCGCG GTCAATACTT TGATAAACCA 
ATGACGCCAA ACAGCCCTGG ATATCCAACG ATTACTTTTG ACGAAAATAC CAATAGTTAC 
ACGTTTGATT TTGGAAAAAC CAACAAACGT TACATTATTG AGTATAAAAA CGCCAATGGC 
TGGATCGACG TGCCAACTCT TTATATAACA GGGACAGCGA AAGAACCACA ATCGAATAAT 
AATGAAGGCT CTGCTTCGGT TTCTGTTCAA AATGAAGCGT TAGACATTTT GAGTGCAACA 
CAAGCGGCGA ATCCAACATT AAAAAATGTA ACAAAAACGA CAGTAACAAC AAAAAATATT 
GATAATAAAA CACATCGTGT GAAAAATCCA ACGATTGAAT TAACACCAAA AGGCACAACC 
AATGCTCAAA TCGATTTGAA TTCTATTACC GTGAAAGGCG TGCCAGAAGA TGCTTATTCA 
TTAGAGAAGA CTACAAACGG TGCGAAAGTC ATTTTTAAAG ACTATACATT GACAGAAAAC 
ATTACGATTG AATACAATAC GGTCTCTGCA AACGCTGGCC AAATCTATAC AGAAACAACA 
ATCGACTCTG AAACATTGAA CCAGATGTCT GCTAGCAAGA AAAAAGTCAC CACTGCGCCA 
ATCACATTGA AATTCTCAGA AGGTGATGCG GAAGGTATTG TTTATTTAGC AACTGCCACA 
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TTCTACACGC ATAACGTAGA GGATGAAAAC CAAGCAATTG CGAAGGTTTC TTTTGAACTA 
ATTGATAATG TCACGCATAC AGCAACCGAA TTTACAACAG ATGAAAAAGG TCAATACTCC 
TTTGATGCCA TCATGACAGG TGATTATACT TTGCGAGTAA CGAATGTACC GCAGGAATAT 
TCCGTGGATG AAGAGTATTT GACAGGAAAA GCCATTAAGC TGGTCAAAGG AGACAACCAA 
CTAAAAATTC CATTAACGAA AACAATTGAT CACAGTCGTT TACAAGTCAA AGATTCAACG 
ATTTATGTCG GCGATTCATG GAAACCAGAA GAGAACTTTG TTTCAGCAAC AGATAAAACA 
GGTCAAGACG TTCCCTTCGA AAAAATCACT GTTTCAGGTC AAGTTGATAA CANCAAAGCA 
GGCGTTTATC CAATTATTTA CAGTGACGAA GGTAAAGAAG AAACAGCCTA TGTGACCGTC 
AAACCCGACC AATCTAAGTT AGAGGTCAAA GATACAACGA TTTATGTTGG TGATTCGTGG 
AAACCAGAAG ATAATTTCGT TTCAGCGACA GACAAAACAG GTCAAGACGT NCCGTTTGAA 
AAAATTGATG TTCAGGGAAC AGTGAATGTT GATAAAATAG GCGATTATGA AATTGTCTAT 
AAAAATGGCA NAAAAGAAGC GAAAGCAATC GTTCATGTCC GTGATGACAG TCAGTTAGAG 
GTTAAAGATA CAACGATTTA TGTTGGTGAT TCGTGGAAAC CAGAAGATAA TTTCGTTTCA 
GCAACAGACA AAACAGGCCA AGACGTTCCG TTTGAAAAAA TCACTGTTTC AGGTCAAGTT 
GATACTAGCA AAGCAGGCGT TTATCCAATC GTTTACAGTT ACGAAGGTAA AGAAGAAACA 
GCTAATGTGA CTGTCAAACC CGACCAATCT AAGTTAGAGG TTAAAGATAC AACGATTTAT 
GTGGGCGATA AATGGGAACC AGAAGATAAT TTCGTTTCAG CAACAGACAA AACAGGTCAA 
GATGTCCCGT TTGAAAAAAT TGACGTTCAG GGAACAGTGA ATGTTGATAA AATAGGCGAT 
TATGAAATTG TCTATAAAAA TGGCACAAAA GAAGCGAAAG CAATCGTTCA TGTCCGTGAT 
GACAGTCAGT TAGAGGTCAA AGATACAACA ATTTATGTGG GTGATAAATG GGAAGCAGAA 
GATAACTTCG TTTCCGCGAC AGACAAAACA GGTCAAGACG TTCCGTTTGA AAAAATTGAT 
GTTCAGGGAA CAGTGAATGT TGATAAAATA GGCGATTATG AAATTGTCTA TAAAAATGGC 
ACAAAAGAAG CGAAAGCAAT CGTTCATGTC CGTGATGATA GTCGTTTACA AGTCAAGGAT 
ACAACGATTT ATGTCGGCGA TTCNTGGANA CCAGAAGNGA ACTTTGTTTC AGCNACAGAT 
AAAACAGGTC AAGATGTCCC ATTCGAAAAA ATCACTGTT 

EF104-2 (SEQ ID NO:402) 

MKKKTF SFVMLSILLA QNFGFAVNAY AVTTTEAQTE TTDTAKKEAE LSNSTPSLPL 
ATTTTSEMNQ PTATTESQTT EASTTASSDA ATPSEQQTTE DKDTSLNEKA LPDVQAPITD 
ELLDSMSLAP IGGTEYSQTE VHRELNTTPV TATFQFAVGN TGYAPGSVYT VQLPEHLGYS 
TVSGEVTGIG ATWAVDAATK TLSITFNQRV SDTSFKVELK SYLTTEAEPL IKIETPGKNK 
KTYSFDLYEQ VEPIQYNERT RTTGLDGEIF YNLDRTLTGN QTLELLTTET PGAVFGKQDN 
LEPQVFSYDV DINGQILPET QTLLTPGKDY TLSDNSLGRI AVTVPNMNQQ KAYSLSINRT 
lYLESASDYN YLYSQQYPTT KIGSISLKST TGTKQTTDFT AKTSQTSKVI ADREMRSMSY 
ISFQSKGKYY VTIYGTLTET KVGQQIVLES TNGQEIKNPK FTAYGPLYEN VKLEDYPDIK 
TEGGKLTLTA TKDSYLRINI SDLTMDFDKK DINLSLSTPV IGPNKAIQLV SDQYIEPISV 
VNPLNAETAW GNYDQNGAYS SRTTVSVMGS KEKPIQNLEI KVKHPNYLSL RATKEIYFYY 
KLGTDYTVTP TSDGSVIKFT TPITNEIQIP IGFNYVPDSL PKDKSIPVDT IPITMSAEGL 
TPVDTTVTTN SKRGSERTLQ SSKNQFLVNA RNDSFDSLSV RTKIPAGADV LFDIYDVSND 
QVDSIYPQYW DRGQYFDKPM TPNSPGYPTI TFDENTNSYT FDFGKTNKRY IIEYKNANGW 
IDVPTLYITG TAKEPQSNNN EGSASVSVQN EALDILSATQ AANPTLKNVT KTTVTTKNID- 
NKTHRVKNPT lELTPKGTTN AQIDLNSITV KGVPEDAYSL EKTTNGAKVI FKDYTLTENI 
TIEYNTVSAN AGQIYTETTI DSETLNQMSA SKKKVTTAPI TLKFSEGDAE GIVYLATATF 
YTHNVEDENQ AIAKVSFELI DNVTHTATEF TTDEKGQYSF DAIMTGDYTL RVTNVPQEYS 
VDEEYLTGKA IKLVKGDNQL KIPLTKTIDH SRLQVKDSTI YVGDSWKPEE NFVSATDKTG 
QDVPFEKITV SGQVDNXKAG VYPIIYSDEG KEETAYVTVK PDQSKLEVKD TTIYVGDSWK 
PEDNFVSATD KTGQDVPFEK IDVQGTVNVD KIGDYEIVYK NGXKEAKAIV HVRDDSQLEV 
KDTTIYVGDS WKPEDNFVSA TDKTGQDVPF EKITVSGQVD TSKAGVYPIV YSYEGKEETA 
NVTVKPDQSK LEVKDTTIYV GDKWEPEDNF VSATDKTGQD VPFEKIDVQG TVNVDKIGDY 
EIVYKNGTKE AKAIVHVRDD SQLEVKDTTI YVGDKWEAED NFVSATDKTG QDVPFEKIDV 
QGTVNVDKIG DYEIVYKNGT KEAKAIVHVR DDSRLQVKDT TIYVGDSWXP EXNFVSATDK 
TGQDVPFEKI TV 
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EF104-3 (SEQ ID NO:403) 



TGTAA CAACGACAGA AGCACAAACA 
GAGACCACTG ATACAGCAAA AAAAGAGGCA 
TTAGCAACAA CGACTACTTC AGAAATGAAT 
ACAGAGGCGA GCACAACAGC TTCCAGTGAT 
GAGGACAAGG ACACCTCACT TAATGAAAAA 
GATGAACTAC TTGACAGTAT GAGTCTTGCG 
GAGGTTCACC GCGAATTAAA TACAACACCG 
AACACAGGTT ATGCACCTGG ATCAGTTTAT 
TCAACTGTCA GCGGAGAAGT GACAGGCATT 
AAAACATTAA GTATTACGTT TAATCAACGA 
AAAAGTTATC TAACAACAGA GGCGGAACCA 
AAAAAAACCT ACTCGTTTGA TTTATATGAA 
ACCAGAACGA CGGGGTTAGA TGGCGAAATT 
AATCAAACAT TAGAATTATT AACAACAGAG 
AACTTGGAAC CTCAAGTTTT CAGTTACGAT 
ACGCAAACCT TGTTAACACC TGGCAAAGAT 
ATTGCTGTAA CTGTTCCAAA CATGAATCAA 
ACAATTTATT TAGAGAGTGC TTCGGACTAT 
ACAAAAATTG GGTCAATTTC TTTGAAAAGT 
ACTGCTAAGA CGAGTCAAAC AAGTAAAGTA 
TATATCAGTT TTCAAAGCAA AGGGAAATAT 
ACAAAAGTGG GTCAACAAAT CGTATTAGAG 
AAATTTACGG CGTATGGTCC TTTATATGAA 
AAAACTGAAG GTGGCAAGCT CACTTTAACG 
ATTTCTGATT TAACAATGGA TTTTGACAAG 
GTAATTGGTC CTAATAAAGC CATTCAATTA 
GTTGTTAATC CTTTGAATGC TGAAACTGCT 
TCATCAAGAA CAACTGTCTC AGTTATGGGA 
ATTAAAGTAA AGCATCCTAA TTATCTTTCA 
TACAAGTTAG GAACGGATTA TACAGTAACG 
ACTACGCCAA TAACCAACGA AATCCAAATT 
TTGCCAAAAG ATAAAAGTAT CCCAGTCGAT 
TTAACTCCAG TTGATACGAC AGTAACTACT 
CAAAGTAGTA AAAATCAATT CCTTGTCAAT 
GTCCGTACAA AAATTCCAGC TGGCGCCGAT 
GATCAGGTAG ATTCAATTTA TCCACAATAC 
ATGACGCCAA ACAGCCCTGG ATATCCAACG 
ACGTTTGATT TTGGAAAAAC CAACAAACGT 
TGGATCGACG TGCCAACTCT TTATATAACA 
AATGAAGGCT CTGCTTCGGT TTCTGTTCAA 
CAAGCGGCGA ATCCAACATT AAAAAATGTA 
GATAATAAAA CACATCGTGT GAAAAATCCA 
AATGCTCAAA TCGATTTGAA TTCTATTACC 
TTAGAGAAGA CTACAAACGG TGCGAAAGTC 
ATTACGATTG AATACAATAC GGTCTCTGCA 
ATCGACTCTG AAACATTGAA CCAGATGTCT 
ATCACATTGA AATTCTCAGA AGGTGATGCG 
TTCTACACGC ATAACGTAGA GGATGAAAAC 
ATTGATAATG TCACGCATAC AGCAACCGAA 
TTTGATGCCA TCATGACAGG TGATTATACT 
TCCGTGGATG AAGAGTATTT GACAGGAAAA 
CTAAAAATTC CATTAACGAA AACAATTGAT 



GAGTTATCGA ACTCAACACC ATCTTTACCT 
CAACCAACTG CAACAACTGA ATCGCAAACC 
GCTGCTACAC CATCTGAACA ACAAACAACG 
GCCCTGCCAG ATGTTCAAGC GCCAATTACA 
CCGATTGGTG GAACAGAATA CAGCCAAACA 
GTAACCGCTA CGTTCCAATT TGCTGTTGGA 
ACAGTTCAAT TACCAGAACA TTTAGGTTAT 
GGCGCAACTT GGGCAGTCGA TGCGGCGACC 
GTTTCAGATA CTTCCTTTAA AGTAGAACTA 
TTAATCAAAA TTGAAACTCC AGGAAAAAAT 
CAAGTGGAAC CAATTCAATA TAACGAACGA 
TTTTATAATT TAGACCGGAC GTTAACTGGC 
ACGCCAGGCG CTGTCTTTGG AAAACAAGAT 
GTCGACATTA ATGGTCAAAT TTTACCAGAA 
TATACATTAA GCGATAATTC ACTCGGGCGG 
CAAAAAGCCT ATTCCTTATC GATTAATCGG 
AACTACTTAT ATTCGCAGCA GTATCCAACA 
ACGACAGGAA CTAAACAAAC AACCGATTTT 
ATTGCTGATC GTGAAATGCG TAGTATGTCC 
TATGTAACAA TTTATGGCAC GTTAACAGAA 
AGTACAAACG GTCAAGAAAT TAAGAATCCT 
AATGTAAAAT TGGAAGACTA TTTTGATATT 
GCCACAAAAG ATAGCTATTT AAGAATAAAT 
AAGGACATTA ATCTATCATT AAGTACACCT 
GTATCCGATC AATATATTGA ACCAATTAGT 
TGGGGTAATT ATGATCAAAA TGGTGCCTAT 
AGCAAAGAGA AACCGATTCA AAATTTAGAA 
TTACGAGCTA CAAAAGAAAT TTATTTTTAT 
CCAACGTCAG ATGGTTCAGT TATTAAGITC 
CCAATTGGTT TTAATTATGT GCCAGATAGT 
ACGATACCGA TAACAATGAG TGCTGAAGGT 
AATAGTAAGC GTGGTTCTGA ACGAACACTT 
GCACGAAATG ATTCTTTTGA CTCACTAAGC 
GTTCTTTTTG ACATTTATGA TGTTTCAAAC 
TGGGACCGCG GTCAATACTT TGATAAACCA 
ATTACTTTTG ACGAAAATAC CAATAGTTAC 
TACATTATTG AGTATAAAAA CGCCAATGGC 
GGGACAGCGA AAGAACCACA ATCGAATAAT 
AATGAAGCGT TAGACATTTT GAGTGCAACA 
ACAAAAACGA CAGTAACAAC AAAAAATATT 
ACGATTGAAT TAACACCAAA AGGCACAACC 
GTGAAAGGCG TGCCAGAAGA TGCTTATTCA 
ATTTTTAAAG ACTATACATT GACAGAAAAC 
AACGCTGGCC AAATCTATAC AGAAACAACA. 
GCTAGCAAGA AAAAAGTCAC CACTGCGCCA 
GAAGGTATTG TTTATTTAGC AACTGCCACA 
CAAGCAATTG CGAAGGTTTC TTTTGAACTA 
TTTACAACAG ATGAAAAAGG TCAATACTCC 
TTGCGAGTAA CGAATGTACC GCAGGAATAT 
GCCATTAAGC TGGTCAAAGG AGACAACCAA 
CACAGTCGTT TACAAGTCAA AGATTCAACG 
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ATTTATGTCG GCGATTCATG GAAACCAGAA GAGAACTTTG TTTCAGCAAC AGKHAPJU^Ck 
GGTCAAGACG TTCCCTTCGA AAAAATCACT GTTTCAGGTC AAGTTGATAA CANCAAAGCA 
GGCGTTTATC CAATTATTTA CAGTGACGAA GGTAAAGAAG AAACAGCCTA TGTGACGGTC 
AAACCCGACC AATCTAAGTT AGAGGTCAAA GATACAACGA TTTATGTTGG TGATTCGTGG 
AAACCAGAAG ATAATTTCGT TTCAGCGACA GACAAAACAG GTCAAGACGT NCCGTTTGAA 
AAAATTGATG TTCAGGGAAC AGTGAATGTT GATAAAATAG GCGATTATGA AATTGTCTAT 
AAAAATGGCA NAAAAGAAGC GAAAGCAATC GTTCATGTCC GTGATGACAG TCAGTTAGAG 
GTTAAAGATA CAACGATTTA TGTTGGTGAT TCGTGGAAAC CAGAAGATAA TTTCGTTTCA 
GCAACAGACA AAACAGGCCA AGACGTTCCG TTTGAAAAAA TCACTGTTTC AGGTCAAGTT 
GATACTAGCA AAGCAGGCGT TTATCCAATC GTTTACAGTT ACGAAGGTAA AGAAGAAACA 
GCTAATGTGA CTGTCAAACC CGACCAATCT AAGTTAGAGG TTAAAGATAC AACGATTTAT 
GTGGGCGATA AATGGGAACC AGAAGATAAT TTCGTTTCAG CAACAGACAA AACAGGTCAA 
GATGTCCCGT TTGAAAAAAT TGACGTTCAG GGAACAGTGA ATGTTGATAA AATAGGCGAT 
TATGAAATTG TCTATAAAAA TGGCACAAAA GAAGCGAAAG CAATCGTTCA TGTCCGTGAT 
GACAGTCAGT TAGAGGTCAA AGATACAACA ATTTATGTGG GTGATAAATG GGAAGCAGAA 
GATAACTTCG TTTCCGCGAC AGACAAAACA GGTCAAGACG TTCCGTTTGA AAAAATTGAT 
GTTCAGGGAA CAGTGAATGT TGATAAAATA GGCGATTATG AAATTGTCTA TAAAAATGGC 
ACAAAAGAAG CGAAAGCAAT CGTTCATGTC CGTGATGATA GTCGTTTACA AGTCAAGGAT 
ACAACGATTT ATGTCGGCGA TTCNTGGANA CCAGAAGNGA ACTTTGTTTC AGCNACAGAT 
AAAACAGGTC AAGATGTCCC ATTC 

EF104-4 (SEQ ID NO: 404) 

VTTTEAQTE TTDTAKKEAE LSNSTPSLPL 

ATTTTSEMNQ PTATTESQTT EASTTASSDA ATPSEQQTTE DKDTSLNEKA LPDVQAPITD 
ELLDSMSLAP IGGTEYSQTE VHRELNTTPV TATFQFAVGN TGYAPGSVYT VQLPEHLGYS 
TVSGEVTGIG ATWAVDAATK TLSITFNQRV SDTSFKVELK SYLTTEAEPL IKIETPGKNK 
KTYSFDLYEQ VEPIQYNERT RTTGLDGEIF YNLDRTLTGN QTLELLTTET PGAVFGKQDN 
LEPQVFSYDV DINGQILPET QTLLTPGKDY TLSDNSLGRI AVTVPNMNQQ KAYSLSINRT 
lYLESASDYN YLYSQQYPTT KIGSISLKST TGTKQTTDFT AKTSQTSKVI ADREMRSMSY 
ISFQSKGKYY VTIYGTLTET KVGQQIVLES TNGQEIKNPK FTAYGPLYEN VKLEDYFDIK 
TEGGKLTLTA TKDSYLRINI SDLTMDFDKK DINLSLSTPV IGPNKAIQLV SDQYIEPISV 
VNPLNAETAW GNYDQNGAYS SRTTVSVMGS KEKPIQNLEI KVKHPNYLSL RATKEIYFYY 
KLGTDYTVTP TSDGSVIKFT TPITNEIQIP IGFNYVPDSL PKDKSIPVDT IPITMSAEGL 
TPVDTTVTTN SKRGSERTLQ SSKNQFLVNA RNDSFDSLSV RTKIPAGADV LFDIYDVSND 
QVDSIYPQYW DRGQYFDKPM TPNSPGYPTI TFDENTNSYT FDFGKTNKRY IIEYKNANGW 
IDVPTLYITG TAKEPQSNNN EGSASVSVQN EALDILSATQ AANPTLKNVT KTTVTTKNID 
NKTHRVKNPT lELTPKGTTN AQIDLNSITV KGVPEDAYSL EKTTNGAKVI FKDYTLTENI 
TIEYNTVSAN AGQIYTETTI DSETLNQMSA SKKKVTTAPI TLKFSEGDAE GIVYLATATF 
YTHNVEDENQ AIAKVSFELI DNVTHTATEF TTDEKGQYSF DAIMTGDYTL RVTNVPQEYS 
VDEEYLTGKA IKLVKGDNQL KIPLTKTIDH SRLQVKDSTI YVGDSWKPEE NFVSATDKTG 
QDVPFEKITV SGQVDNXKAG VYPIIYSDEG KEETAYVTVK PDQSKLEVKD TTIYVGDSWK 
PEDNFVSATD KTGQDVPFEK IDVQGTVNVD KIGDYEIVYK NGXKEAKAIV HVRDDSQLEV 
KDTTIYVGDS WKPEDNFVSA TDKTGQDVPF EKITVSGQVD TSKAGVYPIV YSYEGKEETA 
NVTVKPDQSK LEVKDTTIYV GDKWEPEDNF VSATDKTGQD VPFEKIDVQG TVNVDKIGDY 
EIVYKNGTKE AKAIVHVRDD SQLEVKDTTI YVGDKWEAED NFVSATDKTG QDVPFEKIDV 
QGTVNVDKIG DYEIVYKNGT KEAKAIVHVR DDSRLQVKDT TIYVGDSWXP EXNFVSATDK 
TGQDVPF 

EF105-1 (SEQ ID NO:405) 



TAAATGAAAA AAACAGTCGT CTACTCCTTG TTATTCGGAA 
GTTCCTGCTG AAGCGGCGAC GGTCGTTTTT GATAGCGAAC 
AGCACAGATG GGACGGATCC AGTAAATCCA GAAAATCCCG 



CAATGTTGCT TGGCGCCACT 
AGTCGATTGT TTTTACCCCA 
ATCCAGAAAA ACCAGTTCGA 
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CCAGTCGATC CAACGAATCC TGATGGACCT AATCCAGGTA CCCCTGGTCC ACTTTCCATC 
GATTATGCCT CAAGTTTGGA TTTTGGGAGT AATGAGATAT CGAATAAGGA TCAAACGTAT 
TTTGCCAGAG CGCAAACCTA TAGAAATCCA GATGGTTCAG CAAGTGAATT GGCAACTGCT 
AATTATGTAC AAGTAAGTGA TTTACGGGGA ACCAATGCTG GCTGGGTTTT AAAAGTGAAA 
CAAAATGGTC AATTTCGTAA TGCAGAAACA TTACACAAAG AATTAACAGG GGCCACCGTC 
GCCTTTACTG AGCCCAGTGT TCGCTCAAAT GCGACGGACG TATTGCCGCC AACTGCTACC 
GCAAACATTC AATTAGATGC TGCGGGCGCA GAAACTGTTG TCATGCAAGC CCCAGAAAAG 
ACCGGCGCCG GAACGTGGAT CACGCTGTGG GGGCAAGCAG AAAAAGTGAC CGAAAAAAAT 
CAACAAGGAC AGCAAGTAAA TGCCAC AATC ACACGGGCAA TCTCACTAAC TGTTCCTGGG 
AAAACCCCTA AGGATGCAGT ACAATATAAA ACAACATTGA CTTGGCTACT TTCAGATGTA 
CCAGTAAATA ATGGAGGGAA ATAA 



EF105-2 (SEQ ID NO:406) 

MKKTWYSLL FGTMLLGATV PAEAATVVFD 
VDPTNPDGPN PGTPGPLSID YASSLDFGSN 
YVQVSDLRGT NAGWVLKVKQ NGQFRNAETL 
NIQLDAAGAE TWMQAPEKT GAGTWITLWG 
TPKDAVQYKT TLTWLLSDVP VNNGGK 

EF105-3 (SEQ ID NO:407) 



SEQSIVFTPS TDGTDPVNPE NPDPEKPVRP 
EISNKDQTYF ARAQTYRNPD GSASELATAN 
HKELTGATVA FTEPSVRSNA TDVLPPTATA 
QAEKVTEKNQ QGQQVNATIT RAISLTVPGK 



GGCGAC GGTCGTTTTT GATAGCGAAC AGTCGATTGT TTTTACCCCA 

AGCACAGATG GGACGGATCC AGTAAATCCA GAAAATCCCG ATCCAGAAAA ACCAGTTC<3A 
CCAGTCGATC CAACGAATCC TGATGGACCT AATCCAGGTA CCCCTGGTCC ACTTTCCATC 
GATTATGCCT CAAGTTTGGA TTTTGGGAGT AATGAGATAT CGAATAAGGA TCAAACGTAT 
TTTGCCAGAG CGCAAACCTA TAGAAATCCA GATGGTTCAG CAAGTGAATT GGCAACTGCT 
AATTATGTAC AAGTAAGTGA TTTACGGGGA ACCAATGCTG GCTGGGTTTT AAAAGTGAAA 
CAAAATGGTC AATTTCGTAA TGCAGAAACA TTACACAAAG AATTAACAGG CGCCACCGTC 
GCCTTTACTG AGCCCAGTGT TCGCTCAAAT GCGACGGACG TATTGCCGCC AACTGCTACC 
GCAAACATTC AATTAGATGC TGCGGGCGCA GAAACTGTTG TCATGCAAGC CCCAGAAAAG 
ACCGGCGCCG GAACGTGGAT CACGCTGTGG GGGCAAGCAG AAAAAGTGAC CGAAAAAAAT 
CAACAAGGAC AGCAAGTAAA TGCCACAATC ACACGGGCAA TCTCACTAAC TGTTCCTGGG 
AAAACCCCTA AGGATGCAGT AC 

EF105-4 (SEQ ID NO:408) 

ATWFD SEQSIVFTPS TDGTDPVNPE NPDPEKPVRP 

VDPTNPDGPN PGTPGPLSID YASSLDFGSN EISNKDQTYF ARAQTYRNPD GSASELATAN 
YVQVSDLRGT NAGWVLKVKQ NGQFRNAETL HKELTGATVA FTEPSVRSNA TDVLPPTATA 
NIQLDAAGAE TWMQAPEKT GAGTWITLWG QAEKVTEKNQ QGQQVNATIT RAISLTVPGK 
TPKDAV 

EF106-1 (SEQ ID NO:409) 

TAGTCGTTTA TGAAGAAAAA AATCGTTGGT ACAATTACGT TGTTGGCTTT AAGTGCGTTA 
TTAGTTGGTG GAGCAGGAGG GGCTTTGACG GCAGAAGCAT ACGTTCCTCA AAGCGTAGAC 
AATCCCAATA ATTTAGGGGA TTTACCTGAG TATTTACGTT CAGTTGGTAT TAGACAAGAT 
GAAGGATTAT CAGAAAAAGA TTGGGCTGGA ACACGCGTTT ATGATCGAAA TGGGAATGAC 
TTAACAGATG AAAATCAAAA CCTATTACAT GCAATCAAAT TTGATGCAAC CACTAGTTTC 
TATGAATTTT TTGATAAAGA GACTGGAGAA TCAACAGGAG ATGAAGGAAC CTTCTTTATG 
ACCGCTGGTA TTACAGATGT TTCCCGTCTT GTAATTATTT CTGAAACCAA AAATTATCAA 
GGTGTATACC CACTTAGAAC TTTATACCAA GATACTTTTA CGTATAGACA GATGGGGAAA 
GATAAAAACG GAAATGATAT TGAAGTTTTC GTAGAAAACA AAGCAACCTC AGGACCAGTT 
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TATGGTCGTC CGCAGCCATA CCCCAATAAT CGTCCCAGAA CACTAGAATT CACGAATGGA 
CGCCGTGCCA TGACAGAACA AACAGGCCAG ATTGATGTAA ATCGACAAGG GGATGAAATT 
ATTGGTAAAA CTTCCTTTGA TGGGACACCG CAACTTCTTT GGAATGGCAC AAAAGTAGTG 
GATAAAGATG GCAATGACGT AACTTCGGCC AACCAAAACT TTATCAGCTT AGCGAAATTT 
GACCAAGATA GCAGCAAATA TGAATTTTTC AATTTACAAA CTGGTGAAAC TCGTGGCGAC 
TATGGCTACT TTAAAGTAGG AAATCAAAAT AAATTCCGTG CCCATGTTTC CATTGGAACC 
AATCGCTATG GCGCTGTCTT AGAGTTAACA GAATTGAATG ATAATCGTTT TACGTACACA 
CGAATGGGTA AAGATAACGA AGGAAACGAT ATCCAAGTCT ATGTGGAACA TGAACCATAC 
CAAGGAACTT TTAATCCTGA ATTTACCTTT TAA 



EF106-2 (SEQ ID NO:410) 



MKKKIVGT ITLLALSALL VGGAGGALTA EAYVPQSVDN PNNIiGDLPEY LRSVGIRQDE 
GLSEKDWAGT RVYDRNGNDL TDENQNLLHA IKFDATTSFY EFFDKETGES TGDEGTFFMT 
AGITDVSRLV IISETKNYQG VYPLRTLYQD TFTYRQMGKD KNGNDIEVFV ENKATSGPVY 
GRPQPYPNNR PRTLEFTNGR RAMTEQTGQI DVNRQGDEII GKTSFDGTPQ LLWNGTKWD 
KDGNDVTSAN QNFISIiAKPD QDSSKYEFFN LQTGETRGDY GYFKVGNQNK FRAHVSIGTN 
RYGAVLELTE LNDNRFTYTR MGKDNEGNDI QVYVEHEPYQ GTFNPEFTF 



EF106-3 (SEQ ID N0:411) 
AT ACGTTCCTCA AAGCGTAGAC 

AATCCCAATA ATTTAGGGGA TTTACCTGAG TATTTACGTT CAGTTGGTAT TAGACAAGAT 
GAAGGATTAT CAGAAAAAGA TTGGGCTGGA ACACGCGTTT ATGATCGAAA TGGGAATGAC 
TTAACAGATG AAAATCAAAA CCTATTACAT GCAATCAAAT TTGATGCAAC CACTAGTTTC 
TATGAATTTT TTGATAAAGA GACTGGAGAA TCAACAGGAG ATGAAGGAAC CTTCTTTATG 
ACCGCTGGTA TTACAGATGT TTCCCGTCTT GTAATTATTT CTGAAACCAA AAATTATCAA 
GGTGTATACC CACTTAGAAC TTTATACCAA GATACTTTTA CGTATAGACA GATGGGGAAA 
GATAAAAACG GAAATGATAT TGAAGTTTTC GTAGAAAACA AAGCAACCTC AGGACCAGTT 
TATGGTCGTC CGCAGCCATA CCCCAATAAT CGTCCCAGAA CACTAGAATT CACGAATGGA 
CGCCGTGCCA TGACAGAACA AACAGGCCAG ATTGATGTAA ATCGACAAGG GGATGAAATT 
ATTGGTAAAA CTTCCTTTGA TGGGACACCG CAACTTCTTT GGAATGGCAC AAAAGTAGTG 
GATAAAGATG GCAATGACGT AACTTCGGCC AACCAAAACT TTATCAGCTT AGCGAAATTT 
GACCAAGATA GCAGCAAATA TGAATTTTTC AATTTACAAA CTGGTGAAAC TCGTGGCGAC 
TATGGCTACT TTAAAGTAGG AAATCAAAAT AAATTCCGTG CCCATGTTTC CATTGGAACC 
AATCGCTATG GCGCTGTCTT AGAGTTAACA GAATTGAATG ATAATCGTTT TACGTACACA 
CGAATGGGTA AAGATAACGA AGGAAACGAT ATCCAAGTCT ATGTGGAACA TGAACCATAC 
CAAGGAACTT 



EF106-4 (SEQ ID NO:412) 
YVPQSVDN PNNLGDLPEY LRSVGIRQDE 

GLSEKDWAGT RVYDRNGNDL TDENQNLLHA IKFDATTSFY EFFDKETGES TGDEGTFFMT 
AGITDVSRLV IISETKNYQG VYPLRTLYQD TFTYRQMGKD KNGNDIEVFV ENKATSGPVY 
GRPQPYPNNR PRTLEFTNGR RAMTEQTGQI DVNRQGDEII GKTSFDGTPQ LLWNGTKWD 
KDGNDVTSAN QNFISLAKFD QDSSKYEFFN LQTGETRGDY GYFKVGNQNK FRAHVSIGTN 
RYGAVLELTE LNDNRFTYTR MGKDNEGNDI QVYVEHEPYQ GT 



EF107-1 (SEQ ID NO: 413) 

TAAAAAACGG CACTCAATAT GTCAAAATTT 
ATANATANAA AAATGCTAGT TATCAGTATC 
CTTTATAGAG ACTATAGATT GAATTTTTAC 
AATTGGAAAA GATGGCTAGT TGTTGGGTTA 



GAAATTTCAA GCTGTGTGTT CTTTGGTAAA 
GATAATAACA GGATACTGAT TAAGAAAGGA 
ATAGAAAGAA GGAGCAAGAT GAAGCGAGTA 
AGTTGTTCTT TGTTCATGGA TTCAGTGGTT 
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GGTGTGACTG TGTTAGCGGA AACGATTACT GGGGCGACGG AGCAAGGAGT AGCAACATCT 
CAGTCGAGTG ACGAAGCGAG CCAGACGACG CAAACAACCG AAGAGTCACA GGCAACGGTC 
GCTAGTGAAG CGAAAACAGT ACCGCCACAG GAAACGGCAA GAATTGCTTC TCGAGCGATT 
GGTTATTCTT CTGTGGAAGG GCGCGAGATT CCCTTTTTCT TTGTGGAGGA AGAC<3GGACG 
TTGTTTGATC CCGACCGAAT TACGATGGCG GTCAATCTTT CCACGTTTTC GTTTTATGAA 
GAGAAATTAC AACGAACCCC CCTTGAGCCC ACCACTGTGA ATGGCGGAAA GTTACTGTCT 
ATTCCAACGT CACCAGCTTT TAAATATGAT ACAAATAACC AGAATCCAAG TAATATTTAT 
GGCGTTTCTG AAGTGTCGTT TACTATTCCT AAGGAGTATC AAAGCCTGGA CATTCGACCA 
AGTACGTTTT ATACAGGAGA CACTACGCAA TATCCAGTGC CAACGGTTTT TGCGAACGTT 
GGGGGCAAAG TGACGAACTA TGTGGGCGCC AATGCGGAGA CGGAATTAGA GTTAACCAAT 
GAAAAAATGC CCAATAAGCT GACGTTTGGT CCTAAAAAGA CGTTTAAATA TACGGTAGCT 
ACGGCACCAG GAGGCGTTAC GTATGCGCTG ACCTATTTTT ATGGAGATGT CGGCGGTCCA 
ACTAGTTCGC ACCAAAGACG AGGAACAGCG GGTCCTGTGT ATTATTATTT AACAAAGCGG 
CGTGTCACGG AAAAATTTGA GAATCCCGCA GGCGGGGCGA TTCCTGCGCC AGAAGGTTAT 
ACGCAGGATA AGAAAACCAT TGTAACAGGG GAGGATTTTA CTTTTACCCA AGAAGGCACC 
TTGCCTGAAC GTTACACAGG CAGTGATGGG AAGACGTATT TATTTAAAGG TTGGTACAAA 
GGGAATGCGA AACCTAGCAC GTTGGAAACC ACCAAAACGC CTAGTTATGC GGTGACCTAT 
GATGACAATG ACGATTTGCA TGTGGTCTAT GAAGAAGCAG TGATGAAAAC CTATACGTTG 
CCAGCGAGAG AAGCTTTGTT CGGCTATGTT GATGAGCAAG GAAACTTGAT TAATCCCGCC 
AAGTTTAAGC TAAGTGCGAC CATGGGTGAA AGTGACGGAG CCACAGGGGA AATGACGACT 
TTTCCCACAA TTGATGGAAT CGATATGCCA GCAAGTCAAT TAAAGAAATT AGCCATCCCG 
CAAAAAGTCT ACACACGCCC AGACGATGGG ACAATCGTAA CTTATGGCCC GCAAGAAGTG 
AGTGTTGAAA TTCCTAAGTA TTACCAGACG ATTTCGATTT CACCAACTAC TGCGTATACA 
GGGGATAAAA CCAAGTATCC AGTACCAAAT GAAGTGCGCC GTGGCATCGA AAACCCCGAC 
AACATTGTTA GTAGTTTAGT GGGAANCNCT GCGTATAACT TGACCCAAAA AAGTGCCACA 
CGCTATACTG CCCGCCGTTC TTACTGGANG TGGGGCCCCA CGAAGACACT TTACTCAATG 
AGTATCTATT CAGGAACTGC TGGGGGCAAC TATAATTTAT CGACCCCTGA TGGCACCATT 
TATTATTACT TAGAAAATCG GCGGGTCACT GAACATTTTG TAGACGAAAG TGGCGCAAAA 
ATCACGCCAC CAACTGGCTT TACACAAGGA AATCAGCTAG TGGTGGACAG TGAAAACTAT 
GTCTACACTG TCGCAAAAGC TTTGCCGAAG ATCTACCAAG CTGGTGAAAA AACCTATATC 
TTCCAAGGCT GGTTTAAAGG CAAAACCAAG CCAGCAACAT TAAAGACGAC AACGACCCCA 
AGTTTTACAC CAACTTTTAA TGATGAGGAC GACATGACCG CTGTGTACCA AGAAGCGATT 
CCCACCGCGG AACTAACGTT AACAGGTGCC GTTGACATAA TCGAAAATGG CGCCACAATG 
GATTACTGGG AGGCGCTACT GAAGAACACA GGCGAAGCGC CGTTAACCAC CATTAAAATC 
AAGCCAACGG CAACTTGGGC GGCTGGCATC GGCGCACCCA ACACGATATT TGTACAAGGA 
ACGGGTCAAA ACACCAAAGC TTTTCCTGTC ACCAAAGAAC AATGGACGAC CGGTGCAGGA 
GTGTCCATCA CGTTGGATCA GCCTTTACCA GCTGGCGGTC AATTAAAAAT GAACTTATTA 
GGAACCGCCG TTACAGGAAA TCCTGGTCAA GTTTTAACCG CTGATGTTGA AGTAACGGGC 
AACTTTGGCA GTTTAACTGC CAAAGATACG GTCCGTATTA AAGACTTAGA TCAAGAAATT 
ACGAGTCCTG ACGGCGACGG CTTTATTAGT ACCCCGACAT TTGATTTTGG TAAACTAGCA 
ATTTCAGGAA GTAAGCAACA ATATGGTTTG AAGAAGGCCG CAGATTACTA CGGCAATGGC 
ACTCGCAACC CTTATTTACG CCTGAATACT AGCCAAGCCA ATTGGAGTTT AACGGCCCAG 
CTATCGCAAC CAAAATCAGC CACAGACAGC TTGCCAACAA CGACCCGCTT GTTGCTAGGA 
ACGGCCGCTG CTGCCAGCTT TACCGATTAC AACCAACCAA CAGAAACCAG GACACCACTT 
GGCAAGACCA GCACCGTGAC TTTAACCGCC GACAATACCG CAACAGCGGT GGTCGCAAAC 
CAACAGTTCA CAGGCAGTGA CGTCTATCAG TTGGACTTCA CGTTTGCTAA CATCAAACTA 
GAAGTGCCAG CCAACCAAGG TATGGCTGGC CAACAATACC AAGCCGCCGT CACGTGGAAT 
TTAGTGACTG GCCCCTAA 



EF107-2 (SEQ ID NO: 414) 
MKRVN 

WKRWLWGLS CSLFMDSWG VTVLAETITG 
SEAKTVPPQE TARIASRAIG YSSVEGREIP 



ATEQGVATSQ SSDEASQTTQ TTEESQATVA 
FFFVEEDGTL FDPDRITMAV NLSTFSFYEE 
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KLQRTPLEPT TVNGGKLLSI PTSPAFKYDT NNQNPSNIYG VSEVSFTIPK EYQSLDIRPS 
TFYTGDTTQY PVPTVFANVG GKVTNYVGAN AETELELTNE KMPNKLTFGP KKTFKYTVAT 
APGGVTYALT YFYGDVGGPT SSHQRRGTAG PVYYYLTKRR VTEKFENPAG GAIPAPEGYT 
QDKKTIVTGE DFTFTQEGTL PERYTGSDGK TYLFKGWYKG NAKPSTLETT KTPSYAVTYD 
DNDDLHWYE EAVMKTYTLP AREALFGYVD EQGNLINPAK FKLSATMGES DGATGEMTTF 
PTIDGIDMPA SQLKKLAIPQ KVYTRPDDGT IVTYGPQEVS VEIPKYYQTI SISPTTAYTG 
DKTKYPVPNE VRRGIENPDN IVSSLVGXXA YNLTQKSATR YTARRSYWXW GPTKTLYSMS 
lYSGTAGGNY NLSTPDGTIY YYLENRRVTE HFVDESGAKI TPPTGFTQGN QLWDSENYV 
YTVAKALPKI YQAGEKTYIF QGWFKGKTKP ATLKTTTTPS FTPTFNDEDD MTAVYQEAIP 
TAELTLTGAV DIIENGATMD YWEALLKNTG EAPLTTIKIK PTATWAAGIG APNTIFVQGT 
GQNTKAFPVT KEQWTTGAGV SITLDQPLPA GGQLKMNLLG TAVTGNPGQV LTADVEVTGN 
FGSLTAKDTV RIKDLDQEIT SPDGDGFIST PTFDFGKLAI SGSKQQYGLK KAADYYGNGT 
RNPYLRLNTS QANWSLTAQL SQPKSATDSL PTTTRLLLGT AAAASFTDYN QPTETRTPLG 
KTSTVTLTAD NTATAWANQ QFTGSDVYQL DFTFANIKLE VPANQGMAGQ QYQAAVTWNL 
VTGP 

EF107-3 (SEQ ID NO:415) 
GG AGCAAGGAGT AGCAACATCT 

CAGTCGAGTG ACGAAGCGAG CCAGACGACG CAAACAACCG AAGAGTCACA GGCAACGGTC 
GCTAGTGAAG CGAAAACAGT ACCGCCACAG GAAACGGCAA GAATTGCTTC TCGAGCGATT 
GGTTATTCTT CTGTGGAAGG GCGCGAGATT CCCTTTTTCT TTGTGGAGGA AGACGGGACG 
TTGTTTGATC CCGACCGAAT TACGATGGCG GTCAATCTTT CCACGTTTTC GTTTTATGAA 
GAGAAATTAC AACGAACCCC CCTTGAGCCC ACCACTGTGA ATGGCGGAAA GTTACTGTCT 
ATTCCAACGT CACCAGCTTT TAAATATGAT ACAAATAACC AGAATCCAAG TAATATTTAT 
GGCGTTTCTG AAGTGTCGTT TACTATTCCT AAGGAGTATC AAAGCCTGGA CATTCGACCA 
AGTACGTTTT ATACAGGAGA CACTACGCAA TATCCAGTGC CAACGGTTTT TGCGAACGTT 
GGGGGCAAAG IXSACGAACTA TGTGGGCGCC AATGCGGAGA CGGAATTAGA GTTAACCAAT 
GAAAAAATGC CCAATAAGCT GACGTTTGGT CCTAAAAAGA CGTTTAAATA TACGGTAGCT 
ACGGCACCAG GAGGCGTTAC GTATGCGCTG ACCTATTTTT ATGGAGATGT CGGCGGTCCA 
ACTAGTTCGC ACCAAAGACG AGGAACAGCG GGTCCTGTGT ATTATTATTT AACAAAGCGG 
CGTGTCACGG AAAAATTTGA GAATCCCGCA GGCGGGGCGA TTCCTGCGCC AGAAGGTTAT 
ACGCAGGATA AGAAAACCAT TGTAACAGGG GAGGATTTTA CTTTTACCCA AGAAGGCACC 
TTGCCTGAAC GTTACACAGG CAGTGATGGG AAGACGTATT TATTTAAAGG TTGGTACAAA 
GGGAATGCGA AACCTAGCAC GTTGGAAACC ACCAAAACGC CTAGTTATGC GGTGACCTAT 
GATGACAATG ACGATTTGCA TGTGGTCTAT GAAGAAGCAG TGATGAAAAC CTATACGTTG 
CCAGCGAGAG AAGCTTTGTT CGGCTATGTT GATGAGCAAG GAAACTTGAT TAATCCCGCC 
AAGTTTAAGC TAAGTGCGAC CATGGGTGAA AGTGACGGAG CCACAGGGGA AATGACGACT 
TTTCCCACAA TTGATGGAAT CGATATGCCA GCAAGTCAAT TAAAGAAATT AGCCATCCCG 
CAAAAAGTCT ACACACGCCC AGACGATGGG ACAATCGTAA CTTATGGCCC GCAAGAAGTG 
AGTGTTGAAA TTCCTAAGTA TTACCAGACG ATTTCGATTT CACCAACTAC TGCGTATACA 
GGGGATAAAA CCAAGTATCC AGTACCAAAT GAAGTGCGCC GTGGCATCGA AAACCCCGAC 
AACATTGTTA GTAGTTTAGT GGGAANCNCT GCGTATAACT TGACCCAAAA AAGTGCCACA 
CGCTATACTG CCCGCCGTTC TTACTGGANG TGGGGCCCCA CGAAGACACT TTACTCAATG 
AGTATCTATT CAGGAACTGC TGGGGGCAAC TATAATTTAT CGACCCCTGA TGGCACCATT 
TATTATTACT TAGAAAATCG GCGGGTCACT GAACATTTTG TAGACGAAAG TGGCGCAAAA 
ATCACGCCAC CAACTGGCTT TACACAAGGA AATCAGCTAG TGGTGGACAG TGAAAACTAT 
GTCTACACTG TCGCAAAAGC TTTGCCGAAG ATCTACCAAG CTGGTGAAAA AACCTATATC 
TTCCAAGGCT GGTTTAAAGG CAAAACCAAG CCAGCAACAT TAAAGACGAC AACGACCCCA 
AGTTTTACAC CAACTTTTAA TGATGAGGAC GACATGACCG CTGTGTACCA AGAAGCGATT 
CCCACCGCGG AACTAACGTT AACAGGTGCC GTTGACATAA TCGAAAATGG CGCCACAATG 
GATTACTGGG AGGCGCTACT GAAGAACACA GGCGAAGCGC CGTTAACCAC CATTAAAATC 
AAGCCAACGG CAACTTGGGC GGCTGGCATC GGCGCACCCA ACACGATATT TGTACAAGGA 
ACGGGTCAAA ACACCAAAGC TTTTCCTGTC ACCAAAGAAC AATGGACGAC CGGTGCAGGA 
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GTGTCCATCA CGTTGGATCA GCCTTTACCA GCUGQCGGTC AATTAAAAAT GAACTTATTA 
GGAACCGCCG TTACAGGAAA TCCTGGTCAA GTTTTAACCG CTGATGTTGA AGTAACGGGC 
AACTTTGGCA GTTTAACTGC CAAAGATACG GTCCGTATTA AAGACTTAGA TCAAGAAATT 
ACGAGTCCTG ACGGCGACGG CTTTATTAGT ACCCCGACAT TTGATTTTGG TAAACTAGCA 
ATTTCAGGAA GTAAGCAACA ATATGGTTTG AAGAAGGCCG CAGATTACTA CGGCAATGGC 
ACTCGCAACC CTTATTTACG CCTGAATACT AGCCAAGCCA ATTGGAGTTT AACGGCCCAG 
CTATCGCAAC CAAAATCAGC CACAGACAGC TTGCCAACAA CGACCCGCTT GTTGCTAGGA 
ACGGCCGCTG CTGCCAGCTT TACCGATTAC AACCAACCAA CAGAAACCAG GACACCACTT 
GGCAAGACCA GCACCGTGAC TTTAACCGCC GACAATACCG CAACAGCGGT GGTCGCAAAC 
CAACAGTTCA CAGGCAGTGA CGTCTATCAG TTGGACTTCA CGTTTGGTAA CATCAAACTA 
GAAGTGCCAG CCAACCAAGG TATGGCTGGC CAACAATACC AAGCCGCCGT CACGTGGAAT 
TTAGTGACTG GCCCCT 

EF107-4 {SEQ ID NO:416) 

EQGVATSQ SSDEASQTTQ TTEESQATVA 

SEAKTVPPQE TARIASRAIG YSSVEGREIP FFFVEEDGTL FDPDRITMAV NLSTFSFYEE 
KLQRTPLEPT TVNGGKLLSI PTSPAFKYDT NNQNPSNIYG VSEVSFTIPK EYQSLDIRPS 
TFYTGDTTQY PVPTVFANVG GKVTNYVGAN AETELELTNE KMPNKLTFGP KKTFKYTVAT 
APGGVTYALT YFYGDVGGPT SSHQRRGTAG PVYYYLTKRR VTEKFENPAG GAIPAPEGYT 
QDKKTIVTGE DFTFTQEGTL PERYTGSDGK TYLFKGWYKG NAKPSTLETT KTPSYAVTYD 
DNDDLHWYE EAVMKTYTLP AREALFGYVD EQGNLINPAK FKLSATMGES DGATGEMTTF 
PTIDGIDMPA SQLKKLAIPQ KVYTRPDDGT IVTYGPQEVS VEIPKYYQTI SISPTTAYTG 
DKTKYPVPNE VRRGIENPDN IVSSLVGXXA YNLTQKSATR YTARRSYWXW GPTKTLYSMS 
lYSGTAGGNY NLSTPDGTIY YYLENRRVTE HFVDESGAKI TPPTGFTQGN QLWDSENYV 
YTVAKALPKI YQAGEKTYIF QGWFKGKTKP ATLKTTTTPS FTPTFNDEDD MTAVYQEAIP 
TAELTLTGAV DIIENGATMD YWEALLKNTG EAPLTTIKIK PTATWAAGIG APNTIFVQGT 
GQNTKAFPVT KEQWTTGAGV SITLDQPLPA GGQLKMNLLG TAVTGNPGQV LTADVEVTGN 
FGSLTAKDTV RIKDLDQEIT SPDGDGFIST PTFDFGKLAI SGSKQQYGLK KAADYYGNGT 
RNPYLRLNTS QANWSLTAQL SQPKSATDSL PTTTRLLLGT AAAASFTDYN QPTETRTPLG 
KTSTVTLTAD NTATAWANQ QFTGSDVYQL DFTFANIKLE VPANQGMAGQ QYQAAVTWNL 
VTGP 

EF108-1 (SEQ ID NO:417) 

TAATCGGTTT GGCGGGAATC GTACATAGAA AGAAGGGACG . ACATGAAGCA AACTAAGTGG 
CAACGATTAG CAACCATTGG CTTGTGTAGT TCTTTAGTAA TTAACGCCTT TTCTGGTGTG 
ACGGCAGTTG CGGAAACCGT GACGATTGAA AGTAGTCCGA CCGCCGAAAG TAGTGCCAAG 
GAAGAGACGC AAGCAAGTAG CGTGAAGGAA GAAACAACGA AAGCCAGTAC GGAAAATAGT 
CAAGTAACAA CTGACACGAG TCAGGAAGAA GCAACGAAAG AAGCGGAGAA AGAAGAACCG 
CAAGCAGAAG TGGAACAAGC AGAAACACCA ATCATTCCTA AACCAAAAAA AATCAATATG 
AAGGCAACTT ATTCATTTTC TGCAGAAACT TATCAGTTTG GATTTGTGAA TGAATCAGGT 
CAATTAATAA ATCCAGATAT TATACCAATT ACGTATAGCT ATGCCAAAGG ATCATGGAAG 
ACAGATGGTT ATAATCGAAA GTGGACTAGT ATGGTTCAAG GGAGTGCTTC AACCGTAGGA 
AACTTAAAGA ATGTAATAAT GCCAGCAACT TCTGTAGTTA TGCCACCAGG ACCGTCATAT 
GAAGGAACTC AAGAGGTGTA CACAAACTTT TCAATTCGCA TACCAAAATA TTATGCATCA 
GCGAGTCTCT ACAATAGAGA AGGTAAAATT GATTCTACTT ATCCGTTACC TGCTATTGCA 
CTAGCAGGTA CTAGACCGCT ATCTTTGACT CAAAGTAGTG TAATTAGTGC ATTGGCGCTG 
ACCAGTAAAG GAGACAATGT TTATACACCA CGGGAAACAT TTTTTGGAGG AGATCCTGCA 
GGTGTAAAGT TTACTAATTT TTTGTATCGT ATAAATGACT TTGATGTGAA AGGTAATAAC 
ATAGGTTATA AGACTGTGAG TAGCCCAATC TATTACCATC TGACCAACCG CCGTGTCACC 
GAAAACTTCG TAGATACAAG TGGCGCCAAA ATCACGCCAC CAAGTAATTT CACCCAAGGG 
AAACAAACGG TCATTAACAG TGATCCTTAC ACGTTCCAAC AAAGTGGTTT TTTACCCGAG 
ACCTACAAAG TTGGCACGAA ATCTTACCGA TTCAAAGGCT GGTACAAACG GAAAACCAAA 
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ACCGAGCCTT TGGCCACCAC TAAAACACCT AGCTATAAAG TCACGTATGA TGACAATGAT 
GATTTGACGG TGGTCTATGA GGAGTTTTCA GGGTACGAGC TGCCTGCTTC GACCAATCAA 
TTTGGCTTTG TGGATGAAGC GACGAACAAA TTAATTGCCC CCGACCAAGT GCAGATGAAG 
TATAATCTTA CTTTAAATGA AAATAATAAA AAAACAGTAA TGAGCAGTAA CTTAACGGGG 
ACAGATACAG CGACACTGAA AAACTTGTCC GTGCCTGTCA ACTATTTTGA ACAATATCGC 
GTCAATACGT TTTATGGCGC GAGTGACATT ACGTTTACAT TGCCCAAACG GTACAAATCA 
ATCAATATTA CCAAATCAGA TGGCAAAACC GACCCAGCTT TTCCTCTTCC TAAAATCTAT 
AATATAGATC AAGTAGAAAT GTCACACATG CCTGTGACCA CTTATAACAA GTTGAAACAG 
CTGTCGGGCC AAACGTTTGG CTTTAATGCT TTAGCCGATC AACCTGAATT TTATACGAAA 
ACGTTATTTG GGACAGAGTC TGGCATCGAT GACCCAGTCA ATTATTATAC AATGAGTGGC 
CCTGTTTACT ATTATTTAGA AAACCGCAAA GTCACCGAGA ACTTCGTAGA CACCAACGGC 
GCTAAAATCA CACCGCCAAC AGGTTTCACC CAAGGTAAAA - AAACGGTGAT TACAAGGGAC 
GCCTACACTT TCAAACAAGC AGGCACCTTA CCAGACACTT ACACAACAGG CGGTAAGACC 
TACAAGTTCA AAGGTTGGTA CAAAGGCAAG TCCATACTCA ACACATTGAC AACTACCAAA 
GCGCCAAGTT ATCAAGTGAC CTACGATGAC AATGATGATT TGAATGTGGT GTATGAAGAA 
GAAACAGTTA CGACAGTGTA TCCATCAGTC GATATGAACT TTGTGAATGA AAAAGGCGGG 
GCTTTCACAC CGGCGTTAAC TTTTAGTGGT AAGTACTATG CGCAAAGTAC GAGTGCGTAC 
TTAAGAACCG ATTTATATGA CGTGACCTCA AAAAATAATG GTAATGGGCA ATATACGGTA 
AGTATTAATA ATGGTAGTAT GCCATTGTCC CAAGAATTAT TGAAAAAATA TAATAATGGA 
CAACCAATCA GTGCTACCAA CAGATTACAG TTTAATGTTG ATAAATTAGC CATCGACCAA 
CAACTAAAAT ATGTTGACAG CATTCAATTA GACACAGCTC AAAGTAGCAA TCTGAAATCC 
TATAGATATG TGTACACGAA CAATAGCTCA CTGGTTTTCG ACCCAAATGT AGCACCAGCA 
GAGGTTGACC TTAGTTCAGA ATCTCTTAAC TTGCTTAATT TTGATTCAGA TGGCACCTAT 
TTTTCTAATG CAAATAATAG ACTTTTTTAC ACGCATTTAG GATATAGTGG CACACCAGGA 
GTTAACTATC TTCTCGTAAT GTTTCTTTTT AACGCCAAAC CTGCGGATAA GTCAAAACTT 
GTCTACAAAG TCACTCGCAA ACAAGTCACC GAAAACTTCG -TGGATGTCAA CGGTGCCAAA 
ATCACTGCAC CAACAGGCTT CACCCAAGGT AACCAAGTAC CAATGAACAG TAACACCTTC 
AAGTACACAG CGGCAA7VAGC TTTACCAGCG ACGTATACTA CAGGTGGCAA AGTCTATACG 
TTCCAAGGGT GGTATAAAGG GAAAACCAAG CCAAGTACGT TGAACAAAAC AACAACTCCA 
ACGTTCAATG CGACCTTTGA TGGCAATGAC GATATGACCG CCATGTATAA GGAAGAAATA 
CCAACAGCTA GTGTCACATT AACTCGACCA AAAGAAGTGA TTGATACGAA TACCAATGTA 
ATCTGGACAA CAACGATCAC GAATACTAGC AAAGCACCCT TACAAAATCT CACCTTGAAA 
AAAGGGCCCA ATTGGTCAGC TGGTCTGACG ATCCCGACCT TTATGGAAGT GACACCAGAA 
GGAGAAACGA CAAAATCAAT CCCAGTAAAT AGTACACTTT GGACAGAGGG GGTTCCTTTA 
CCAAATGCCG TTCCTATCGG CAAAAAAGTT TCAGTTGCTT TCACAACTCG CGCAACAGGG 
AAACCAAACA CTGTTTTGAA AGCAGAAGTT GTAGTATTTG GTGGTATTAA AGATAGTACA 
GTGGATAACT TCGTGAGAAT TCGTCCAAAT GATCAAGAAG TAGTCACACC AACGACCGAA 
GGCTTCATCA GTGTGCCAAC CTTCGACTTC GGCCAAGTGG GCGTTGCAGG AACTAAGCAA 
CAACACAGCT TGAAACAAGC CGCGGATTAC TACGGTAACG GCACACGGAA TCCGTATCTG 
CGGATTAAGA AAACGCAACC CAATTGGAGC TTAACAGCGC AACTGTCACA ACCAAAATCA 
GCGACAGACA GCTTGCCTAC AGCGACCCGC TTATTATTAG GGGCGGCGCC TGTCTCTAGC 
TTTACCAATT ACAATCAACC AACCGAGTTG AAAAATACGG TCGGTACCAC GAGTGCCATT 
AGCTTAACAG CCAACAACAC AGCAACGAGT ATTATTGCCA ACAAGCAATT CACAGGTAGT 
AATGTTTATC AGTTGGACTT CACCTTCAAT AATGTCAAAC TTGAAGTGCC AGCCAATCAA 
GGTGTTAAAG GGCAACAATA CAAGGCCGCA GTTACATGGA ACCTAGTTAC AGGTCCTTAA 

EF108-2 (SEQ ID NO:418) 

MKQTKWQ RLATIGLCSS LVINAFSGVT AVAETVTIES SPTAESSAKE 
ETQASSVKEE TTKASTENSQ VTTDTSQEEA TKEAEKEEPQ AEVEQAETPI IPKPKKINMK 
ATYSFSAETY QFGFVNESGQ LINPDIIPIT YSYAKGSWKT DGYNRKWTSM VQGSASTVGN 
LKNVIMPATS WMPPGPSYE GTQEVYTNFS IRIPKYYASA SLYNREGKID STYPLPAIAL 
AGTRPLSLTQ SSVISAIiALT SKGDNVYTPR ETFFGGDPAG VKFTNFLYRI NDFDVKGNNI 
GYKTVSSPIY YHLTNRRVTE NFVDTSGAKI TPPSNFTQGK QTVINSDPYT FQQSGFLPET 
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YKVGTKSYRF KGWYKGKTKT EPLATTKTPS YKVTYDDNDD LTWYEEFSG YELPASTNQF 
GFVDEATNKL lAPDQVQMKY NLTLNENNKK TVMSSNLTGT DTATLKNLSV PVNYFEQYRV 
ntfygasd.it FTLPKRYKSI NITKSDGKTD PAFPLPKIYN IDQVEMSHMP VTTYNKLKQL 
SGQTFGFNAL ADQPEFYTKT LFGTESGIDD PVNYYTMSGP VYYYLENRKV TENFVDTNGA 
KITPPTGFTQ GKKTVITSDA YTFKQAGTLP DTYTTGGKTY KFKGWYKGKS ILNTLTTTKA 
PSYQVTYDDN DDLNWYEEE TVTTVYPSVD MNFVNEKGGA FTPALTFSGK YYAQSTSAYL 
RTDLYDVTSK NNGNGQYTVS INNGSMPLSQ ELLKKYNNGQ PISATNRLQF nvdklaidqq 
LKYVDSIQLD TAQSSNLKSY RYVYTNNSSL VFDPNVAPAE VDLSSESLNL LNFDSDGTYF 
SNANNRLFYT HLGYSGTPGV NYLLVMFLFN AKPADKSKLV YKVTRKQVTE NFVDVNGAKI 
TAPTGFTQGN QVPMNSNTFK YTAAKALPAT YTTGGKVYTF QGWYKGKTKP STLNKTTTPT 
FNATFDGNDD MTAMYKEEIP TASVTLTRPK EVIDT^P^NVI WTTTITNTSK APLQNLTLKK 
GPNWSAGLTI PTFMEVTPEG ETTKSIPVNS TLWTEGVPLP NAVPIGKKVS VAFTTRATGK 
POTVLKAEW VFGGIKDSTV DNFVRIRPND QEWTPTTEG FISVPTFDFG QVGVAGTKQQ 
HSLKQAADYY GNGTRNPYLR IKKTQPNWSL TAQLSQPKSA TDSLPTATRL LLGAAPVSSF 
TNYNQPTELK NTVGTTSAIS LTANNTATSI IANKQFTGSN VYQLDFTFNN vklevpanqg 
VKGQQYKAAV TWNLVTGP 

EF108-3 (SEQ id N0:419) 

CGT GACGATTGAA AGTAGTCCGA CCGCCGAAAG TAGTGCCAAG 

GAAGAGACGC AAGCAAGTAG CGTGAAGGAA GAAACAACGA AAGCCAGTAC GGAAAATAGT 
CAAGTAACAA CTGACACGAG TCAGGAAGAA GCAACGAAAG AAGCGGAGAA AGAAGAACCG 
CAAGCAGAAG TGGAACAAGC AGAAACACCA ATCATTCCTA AACCAAAAAA AATCAATATG 
AAGGCAACTT ATTCATTTTC TGCAGAAACT TATCAGTTTG GATTTGTGAA TGAATCAGGT 
CAATTAATAA ATCCAGATAT TATACCAATT ACGTATAGCT ATGCCAAAGG ATCATGGAAG 
ACAGATGGTT ATAATCGAAA GTGGACTAGT ATGGTTCAAG GGAGTGCTTC AACCGTAGGA 
AACTTAAAGA ATGTAATAAT GCCAGCAACT TCTGTAGTTA TGCCACCAGG AGCGTCATAT 
GAAGGAACTC AAGAGGTGTA CACAAACTTT TCAATTCGCA TACCAAAATA TTATGCATCA 
GCGAGTCTCT ACAATAGAGA AGGTAAAATT GATTCTACTT ATCCGTTACC TGCTATTGCA 
CTAGCAGGTA CTAGACCGCT ATCTTTGACT CAAAGTAGTG TAATTAGTGC ATTGGCGCTG 
ACCAGTAAAG GAGACAATGT TTATACACCA CGGGAAACAT TTTTTGGAGG AGATCCTGCA 
GGTGTAAAGT TTACTAATTT TTTGTATCGT ATAAATGACT TTGATGTGAA AGGTAATAAC 
ATAGGTTATA AGACTGTGAG TAGCCCAATC TATTACCATC TGACCAACCG CCGTGTCACC 
GAAAACTTCG TAGATACAAG TGGCGCCAAA ATCACGCCAC CAAGTAATTT CACCCAAGGG 
AAACAAACGG TCATTAACAG TGATCCTTAC ACGTTCCAAC AAAGTGGTTT TTTACCCGAG 
ACCTACAAAG TTGGCACGAA ATCTTACCGA TTCAAAGGCT GGTACAAAGG gaaaaccaaa 
accgagcctt tggccaccac TAAAACACCT AGCTATAAAG TCACGTATGA TGACAATGAT 
GATTTGACGG TGGTCTATGA GGAGTTTTCA GGGTACGAGC TGCCTGCTTC GACCAATCAA 
TTTGGCTTTG TGGATGAAGC GACGAACAAA TTAATTGCCC CCGACCAAGT GCAGATGAAG 
TATAATCTTA CTTTAAATGA AAATAATAAA AAAACAGTAA TGAGCAGTAA CTTAACGGGG 
ACAGATACAG CGACACTGAA AAACTTGTCC GTGCCTGTCA ACTATTTTGA ACAATATCGC 
GTCAATACGT TTTATGGCGC GAGTGACATT ACGTTTACAT TGCCCAAACG GTACAAATCA 
ATCAATATTA CCAAATCAGA TGGCAAAACC GACCCAGCTT TTCCTCTTCC TAAAATCTAT 
AATATAGATC AAGTAGAAAT GTCACACATG CCTGTGACCA CTTATAACAA gttgaaacag 

ctgtcgggcc aaacgtttgg ctttaatgct ttagccgatc aacctgaatt ttatacgaaa 
acgttatttg ggacagagtc tggcatcgat gacccagtca attattatac aatgagtggc 
cctgtttact attatttaga aaaccgcaaa gtcaccgaga acttcgtaga caccaac-ggc 

GCTAAAATCA CACCGCCAAC AGGTTTCACC CAAGGTAAAA AAACGGTGAT TACAAGCGAC 
GCCTACACTT TCAAACAAGC AGGCACCTTA CCAGACACTT ACACAACAGG CGGTAAGAGC 
TACAAGTTCA AAGGTTGGTA CAAAGGCAAG TCCATACTCA ACACATTGAC AACTACCAAA 
GCGCCAAGTT ATCAAGTGAC CTACGATGAC AATGATGATT TGAATGTGGT gtatgaagaa 

gaaacagtta cgacagtgta tccatcagtc gatatgaact ttgtgaatca aaaaggcggg 
gctttcacac cggcgttaac ttttagtggt aagtactatg cgcaaagtac hGAGTGGGTAC 

TTAAGAACCG ATTTATATGA CGTGACCTCA AAAAATAATG GTAATGGGCA ATATACGGTA 
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AGTATTAATA ATGGTAGTAT GCCATTGTCC CAAGAATTAT TGAAAAAATA TAATAATGGA 
CAACCAATCA GTGCTACCAA CAGATTACAG TTTAATGTTG ATAAATTAGC CATGGACCAA 
CAACTAAAAT ATGTTGACAG CATTCAATTA GACACAGCTC AAAGTAGCAA TCTGAAATCC 
TATAGATATG TGTACACGAA CAATAGCTCA CTGGTTTTCG ACCCAAATGT AGCACCAGCA 
GAGGTTGACC TTAGTTCAGA ATCTCTTAAC TTGCTTAATT TTGATTCAGA TGGCACCTAT 
TTTTCTAATG CAAATAATAG ACTTTTTTAC ACGCATTTAG GATATAGTGG CACACCAGGA 
GTTAACTATC TTCTCGTAAT GTTTCTTTTT AACGCCAAAC CTGCGGATAA GTCAAAACTT 
GTCTACAAAG TCACTCGCAA ACAAGTCACC GAAAACTTCG TGGATGTCAA CGGTGCCAAA 
ATCACTGCAC CAACAGGCTT CACCCAAGGT AACCAAGTAC CAATGAACAG TAACACCTTC 
AAGTACACAG CGGCAAAAGC TTTACCAGCG ACGTATACTA CAGGTGGCAA AGTCTATACG 
TTCCAAGGGT GGTATAAAGG GAAAACCAAG CCAAGTACGT TGAACAAAAC AACAACTCCA 
ACGTTCAATG CGACCTTTGA TGGCAATGAC GATATGACCG CCATGTATAA GGAAGAAATA 
CCAACAGCTA GTGTCACATT AACTCGACCA AAAGAAGTGA TTGATACGAA TACCAATGTA 
ATCTGGACAA CAACGATCAC GAATACTAGC AAAGCACCCT TACAAAATCT CACCTTGAAA 
AAAGGGCCCA ATTGGTCAGC TGGTCTGACG ATCCCGACCT TTATGGAAGT GACACCAGAA 
GGAGAAACGA CAAAATCAAT CCCAGTAAAT AGTACACTTT GGACAGAGGG GGTTCCTTTA 
CCAAATGCCG TTCCTATCGG CAAAAAAGTT TCAGTTGCTT TCACAACTCG CGCAACAGGG 
AAACCAAACA CTGTTTTGAA AGCAGAAGTT GTAGTATTTG GTGGTATTAA AGATAGTACA 
GTGGATAACT TCGTGAGAAT TCGTCCAAAT GATCAAGAAG TAGTCACACC AACGACCGAA 
GGCTTCATCA GTGTGCCAAC CTTCGACTTC GGCCAAGTGG GCGTTGCAGG AACTAAGCAA 
CAACACAGCT TGAAACAAGC CGCGGATTAC TACGGTAACG GCACACGGAA TCCGTATCTG 
CGGATTAAGA AAACGCAACC CAATTGGAGC TTAACAGCGC AACTGTCACA ACCAAAATCA 
GCGACAGACA GCTTGCCTAC AGCGACCCGC TTATTATTAG GGGCGGCGCC TGTCTCTAGC 
TTTACCAATT ACAATCAACC AACCGAGTTG AAAAATACGG TCGGTACCAC GAGTGCCATT 
AGCTTAACAG CCAACAACAC AGCAACGAGT ATTATTGCCA ACAAGCAATT CACAGGTAGT 
AATGTTTATC AGTTGGACTT CACCTTCAAT AATGTCAAAC TTGAAGTGCC AGCCAATCAA 
GGTGTTAAAG GGCAACAATA CAAGGCCGCA GTTACATGGA ACCTAGTTAC AG 

EF108-4 (SEQ ID NO:420) 

VTIES SPTAESSAKE 

ETQASSVKEE TTKASTENSQ VTTDTSQEEA TKEAEKEEPQ AEVEQAETPI IPKPKKINMK 
ATYSFSAETY QFGFVNESGQ LINPDIIPIT YSYAKGSWKT DGYNRKWTSM VQGSASTVGN 
LKNVIMPATS WMPPGPSYE GTQEVYTNFS IRIPKYYASA SLYNREGKID STYPLPAIAL 
AGTRPLSLTQ SSVISALALT SKGDNVYTPR ETFFGGDPAG VKFTNFLYRI NDFDVKGNNI 
GYKTVSSPIY YHLTNRRVTE NFVDTSGAKI TPPSNFTQGK QTVINSDPYT FQQSGFLPET 
YKVGTKSYRF KGWYKGKTKT EPLATTKTPS YKVTYDDNDD LTWYEEFSG YELPASTNQF 
GFVDEATNKL lAPDQVQMKY NLTLNENNKK TVMSSNLTGT DTATLKNLSV PVNYFEQYRV 
NTFYGASDIT FTLPKRYKSI NITKSDGKTD PAFPLPKIYN IDQVEMSHMP VTTYNKLKQL 
SGQTFGFNAL ADQPEFYTKT LFGTESGIDD PVNYYTMSGP VYYYLENRKV TENFVDTNGA 
KITPPTGFTQ GKKTVITSDA YTFKQAGTLP DTYTTGGKTY KFKGWYKGKS ILNTLTTTKA 
PSYQVTYDDN DDLNWYEEE TVTTVYPSVD MNFVNEKGGA FTPALTFSGK YYAQSTSAYL 
RTDLYDVTSK NNGNGQYTVS INNGSMPLSQ ELLKKYNNGQ PISATNRLQF NVDKLAIDQQ 
LKYVDSIQLD TAQSSNLKSY RYVYTNNSSL VFDPNVAPAE VDLSSESLNL LNFDSDGTYF 
SNANNRLFYT HLGYSGTPGV NYLLVMFLFN AKPADKSKLV YKVTRKQVTE NFVDVNGAKI 
TAPTGFTQGN QVPMNSNTFK YTAAKALPAT YTTGGKVYTF QGWYKGKTKP STLNKTTTPT 
FNATFDGNDD MTAMYKEEIP TASVTLTRPK EVIDTNTNVI WTTTITNTSK APLQNLTLKK 
GPNWSAGLTI PTFMEVTPEG ETTKSIPVNS TLWTEGVPLP NAVPIGKKVS VAFTTRATGK 
PNTVLKAEW VFGGIKDSTV DNFVRIRPND QEWTPTTEG FISVPTFDFG QVGVAGTKQQ 
HSLKQAADYY GNGTRNPYLR IKKTQPNWSL TAQLSQPKSA TDSLPTATRL LLGAAPVSSF 
TISTYNQPTELK NTVGTTSAIS LTANNTATSI lANKQFTGSN VYQLDFTFNN VKLEVPANQG 
VKGQQYKAAV TWNLVT 



EF109-1 (SEQ ID NO:421) 
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AGGAGTAAAT TAATGAAAAA AAGTGTTATA 
GGATTTCTCG TTACCCCTAT TTCTGCTTAC 
GAAACGGTGG CTTCAGAAAC ATCTCTAACG 
GAAATGAACC CAAGCATCAT AAATTCTCAA 
ACCTCCGATT CCACCACa?GA AGTTTCTACA 
NATAGTAGCG ACGTACTGAA ACTACTTTGG 
TAG 



ACTAGTTCTA TGTTAGCGGT TTTGTTGTCG 
GCTTTGGAAC GCTCTAAGGG AACTACTGAA 
GAGCGACAAA TGAGTAGCGG TGTCACTGAA 
GAGGAAACAG AAACAACGTC CACTTCCTCA 
TCAGAAGTAA CAACTGTTAA TGATACAGAA 
NAACATCACN AAGTAATGAG GACACACCTA 



EF109-2 (SEQ ID NO:422) 

MKKSVI TSSMLAVLLS GFLVTPISAY ALERSKGTTE ETVASETSLT ERQMSSGVTE 
EMNPSIINSQ EETETTSTSS TSDSTTEVST SEVTTVNDTE XSSDVLKLLW XHHXVMRTHL 

EF109-3 (SEQ ID NO:423) 

GGAAC GCTCTAAGGG AACTACTGAA 

GAAACGGTGG CTTCAGAAAC ATCTCTAACG GAGCGACAAA TGAGTAGCGG TGTCACTGAA 
GAAATGAACC CAAGCATCAT AAATTCTCAA GAGGAAACAG AAACAACGTC CACTTCCTCA 
ACCTCCGATT CCACCACTGA AGTTTCTACA TCAG 

EF109-4 (SEQ ID N0:424) 

ERSKGTTE ETVASETSLT ERQMSSGVTE EMNPSIINSQ EETETTSTSS TSDSTTEVST S 
EFllO-1 (SEQ ID NO:425) 

TAAATAAAAA TGGATAAGGA GTGGCATAAT CTTATGAAAA AGTTCTCCAT ACGAAAAATT 
AGTGCTGGTT TTTTG'JTTCT GATTTTAGTA ACTTTGATCG CCGGTTTTAG CTTGTCTGCA 
AATGCAGAAG AGTATATCGT TCCTGCCGAA AGTCATTCAC GACAAAAAAG ATCGTTACTG 
GACCCTGAGG ACAGAAGACA AGAAGTGGCA GATACAACCG AAGCGCCTTT TGCGTCAATC 
GGAAGAATCA TTTCCCCTGC CAGTAAACCA GGCTATATTT CTTTAGGAAC AGGCTTTGTT 
GTTGGAACCA ATACAATTGT CACCAATAAT CATGTGGCTG AAAGTTTTAA GAATGCCAAA 
GTATTAAATC CGAATGCCAA AGATGATGCT TGGTTTTATC CAGGTCGAGA TGGCAGTGCG 
ACACCATTTG GCAAATTCAA AGTGATTGAT GTAGCTTTTT CCCCGAATGC GGATATTGCG 
GTAGTGACTG TCGGCAAACA AAACGATCGT CCAGATGGCC CAGAGTTGGG AGAAATTTTA 
ACGCCATTTG TTTTGAAAAA GTTTGAATCT TCAGATACCC ATGTCACAAT ATCAGGCTAT 
CCAGGTGAGA AAAACCACAC ACAATGGTCT CATGAAAATG ATTTGTTTAC ATCTAACTTT 
ACAGACTTAG AAAATCCATT ACTATTTTAT GATATCGATA CAACCGGCGG TCAATCTGGT 
TCACCAATCT ATAATGATCA GGTTGAAGTA GTTGGTGTTC ATTCCAATGG CGGCATTAAG 
CAAACAGGAA ATCATGGTCA AAGACTAAAT GAAGTGAATT ATAACTTTAT TGTTAATCGA 
GTGAATGAAG AAGAAAATAA ACGTTTATCC GCTGTGCCAG CAGCGTAA 

EFllO-2 (SEQ ID NO:426) 

MKKFSIRKIS AGFLFLILVT LIAGFSLSAN AEEYIVPAES HSRQKRSLLD 
PEDRRQEVAD TTEAPFASIG RIISPASKPG YISLGTGFW GTNTIVTNNH VAESFKNAKV 
LNPNAKDDAW FYPGRDGSAT PFGKFKVIDV AFSPNADIAV VTVGKQNDRP DGPELGEILT 
PFVIiKKFESS DTHVTISGYP GEKNHTQWSH ENDLFTSNFT DLENPLLFYD IDTTGGQSGS 
PIYNDQVEW GVHSNGGIKQ TGNHGQRLNE VNYNFIVNRV NEEENKRLSA VPAA 

EFllO-3 (SEQ ID NO: 427) 

AG AGTATATCGT TCCTGCCGAA AGTCATTCAC GACAAAAAAG ATCGTTACTG 
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GACCCTGAGG ACAGAAGACA AGAAGTGGCA GATACAACCG AAGCGCCTTT TGCGTCAATC 
GGAAGAATCA TTTCCCCTGC CAGTAAACCA GGCTATATTT CTTTAGGAAC AGGCTTTGTT 
GTTGGAACCA ATACAATTGT CACCAATAAT CATGTGGCTG AAAGTTTTAA GAATGCCAAA 
GTATTAAATC CGAATGCCAA AGATGATGCT TGGTTTTATC CAGGTCGAGA TGGCAGTGCG 
ACACCATTTG GCAAATTCAA AGTGATTGAT GTAGCTTTTT GCCCGAATGC GGATATTGCG 
GTAGTGACTG TCGGCAAACA AAACGATCGT CCAGATGGCC CAGAGTTGGG AGAAATTTTA 
ACGCCATTTG TTTTGAAAAA GTTTGAATCT TCAGATACCC ATGTCACAAT ATCAGGCTAT 
CCAGGTGAGA AAAACCACAC ACAATGGTCT CATGAAAATG ATTTGTTTAC ATCTAACTTT 
ACAGACTTAG AAAATCCATT ACTATTTTAT GATATCGATA CAACCGGCGG TCAATCTGGT 
TCACCAATCT ATAATGATCA GGTTGAAGTA GTTGGTGTTC ATTCCAATGG CGGCATTAAG 
CAAACAGGAA ATCATGGTCA AAGACTAAAT GAAGTGAATT ATAACTTTAT TGTTAATCGA 
GTGAATGAAG AAGAAAATAA ACGTTTATCC GCTGTGCCAG CAGCGT 



EFllO-4 (SEQ ID NO:428) 

EYIVPAES HSRQKRSLLD 
PEDRRQEVAD TTEAPFASIG RIISPASKPG 
LNPNAKDDAW FYPGRDGSAT PFGKFKVIDV 
PFVLKKFESS DTHVTISGYP GEKNHTQWSH 
PIYNDQVEW GVHSNGGIKQ TGNHGQRLNE 

EFlll-1 (SEQ ID NO:429) 



YISLGTGFW GTNTIVTNNH VAESFKNAKV 
AFSPNADIAV VTVGKQNDRP DGPELGEILT 
ENDLFTSNFT DLENPLLFYD IDTTGGQSGS 
VNYNFIVNRV NEEENKRLSA VPAA 



TGATCAATAC ACTTCGATAC GGTCGCTTTT TTTCTAGAGA AAGTTGAATC TTTCAATAAT 
AAAAAGGGAT ACACTCCATT TGGCATAGTC CTTGCTGATA ATAAATCAGT GTATAAAGCG 
CTATCATTTT ATAGGAGGGG TTTTATGAAG GGTTTATCAA AAAAGAAACG GGTGTCTACT 
TGGTTAGCGT TAGGAATCAC CGTAGTCAGC TGTTTTGCGT TAAGCAGGGA AGTGCAAGCA 
AGTGTTGAAA GAACAAAAGT TGATGAATTT GCAAATGTTT TAGATGTGAG TGCATCACCA 
ACCGAACGGA CGAATGGCGT ATACGATACC AATTATTTTA ATAATTTTTC TGATTTAGGT 
GCATGGCATG GCTACTATTT ACCTGAAAAA AGCAATAAAG AGCTACTGGG TGGTTTTGCG 
GGGCCATTGA TTATTGCGGA AGAATATCCA GTAAACTTGG CGGCAAGTTT AAACAAATTA 
ACGGTCAAAA ATAAAAAAAC GGGAGAAACC TATGATTTAA GCCAAAGCAA CCGCATGGAC 
CTGTCTTATT ATCCTGGGCG CCTAGAGCAA ACCTATGAAT TAGACGATTT AACGATTCAT 
TTAGCTTTAA TTTTTGTCAG CAATCGAACG GCGCTTATCC AAACGACACT TGAAAACACT 
GGTGAAGAGC CCTTGTCACT TGGAGCAAGC TGGACAGGTG CGGTCTTTGA CAAAATTCAA 
GAGGGAACGG AAACCTTAGA TATTGGCACT CGTTTAACTG CTAAAGACAA TGACATTCAA 
GTGAATTTTG GTGAAGTCAG AGAAACGOXSG AATTATTTTG CTACGAAAGA CACAAAATAT 
ACGATTCATC ATGCGGATAA AGTTTCAACA AAAATTGATA ATCGGAATTA TACAGCAACC 
GCTGAACCAA TTGAATTGAA GCCTAAACAA ACGTACAACA CCTATACGAC AGAAAGCTAT 
ACTTTTACAA AAGAAGAAGA GGCAAAGGAA CAACAACAAG CACCCGAATA TACCAAAAAT 
GCGGCGCGCT ATTTCAAAGA GAACAAGCAA AGATGGCAAG GATATCTAGA TAAAACGTTT 
GATCAAAAGA AAACAGCAGA ATTTCCTGAA TATCAAAATG CGCTAGTCAA ATGGATTGAA 
ACGATTAATA CCAATTGGCG AAGTGGGGCA GGTGCCTTTA AGCATGACGG GATTGTTCCG 
TCCATGTCTT ATAAATGGTT TATTGGTATG TGGGCTTGGG ATTCGTGGAA AGCGGATGTA 
GCAACGGCTG ATTTTAATCC TGAGTTAGCT AAAAATAATA TGCGGGCCTT GTTTGATTAT 
CAAATTCAAA AAGATGATAC CGTACGTCCA CAAGATGCAG GAGCGATCAT TGATGCTGTC 
TTTTACAATC AAGACAGTGC GCGTGGTGGT GAAGGTGGCA ACTGGAATGA ACGAAATTCT 
AAACCACCAT TGGCTGCATG GGCAGTTTGG CATATTTATC AAGAAACCAA AGATAAGGAA 
TTTTTAAAAG AAATGTATCC CAAACTTGTG GCTTATCATA ATTGGTGGTA TACCAACAGA 
GACCACAATA AAAATGGGAT AGCAGAATAT GGAAGCATGG TCAGTGATGC TCACTGGCAA 
AAAGACGACA AGGATCAAAT CATTAAAGAT AAAAATGGCC ACCTAAAGTG GATGATGATG 
CTGTTATTGA AGCAGCCGCG TGGGAAAGTG GCATGGATAA CGCTACACGG TTTGACAAAG 
AAGGTGTGGG CAAAGGCGAC GTTGGAGTTA AAGTTTTTGA AAACAAAAAT AAAGGAAAAG 
TAG 
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EFlll-2 (SEQ ID NO:430) 
MKG LSKKKRVSTW 

LALGITWSC FALSREVQAS VERTKVDEFA NVLDVSASPT ERTNGVYDTN YFNNFSDLGA 
WHGYYLPEKS NKELLGGFAG PLIIAEEYPV NLAASLNKLT VKNKKTGETY DLSQSNRMDL 
SYYPGRLEQT YELDDLTIHL ALIFVSNRTA LIQTTLENTG EEPLSLGASW TGAVFDKIQE 
GTETLDIGTR LTAKDNDIQV NFGEVRETWN YFATKDTKYT IHHADKVSTK IDNRNYTATA 
EPIELKPKQT YNTYTTESYT FTKEEEAKEQ QQAPEYTKNA ARYFKENKQR WQGYLDKTFD 
QKKTAEFPEY QNALVKSIET INTNWRSAAG AFKHDGIVPS MSYKWFIGMW AWDSWKADVA 
TADFNPELAK NNMRALFDYQ IQKDDTVRPQ DAGAIIDAVF YNQDSARGGE GGNWNERNSK 
PPLAAWAVWH lYQETKDKEF LKEMYPKLVA YHNWWYTNRD HNKNGIAEYG SMVSDAHWQK 
DDKDQIIKDK NGHLKWMMML LLKQPRGKVA WITLHGLTKK VWAKATLELK FLKTKIKEK 

EFlll-3 (SEQ ID NO:431) 

TGATGAATTT GCAAATGTTT TAGATGTGAG TGCATCACCA 

ACCGAACGGA CGAATGGCGT ATACGATACC AATTATTTTA ATAATTTTTC TGATTTAGGT 
GCATGGCATG GCTACTATTT ACCTGAAAAA AGCAATAAAG AGCTACTGGG TGGTTTTGCG 
GGGCCATTGA TTATTGCGGA AGAATATCCA GTAAACTTGG CGGCAAGTTT AAACAAATTA 
ACGGTCAAAA ATAAAAAAAC GGGAGAAACC TATGATTTAA GCCAAAGCAA CCGCATCGAC 
CTGTCTTATT ATCCTGGGCG CCTAGAGCAA ACCTATGAAT TAGACGATTT AACGATTCAT 
TTAGCTTTAA TTTTTGTCAG CAATCGAACG GCGCTTATCC AAACGACACT TGAAAACACT 
GGTGAAGAGC CCTTGTCACT TGGAGCAAGC TGGACAGGTG CGGTCTTTGA CAAAATTCAA 
GAGGGAACGG AAACCTTAGA TATTGGCACT CGTTTAACTG CTAAAGACAA TGACATTCAA 
GTGAATTTTG GTGAAGTCAG AGAAACGTGG AATTATTTTG CTACGAAAGA CACAAAATAT 
ACGATTCATC ATGCGGATAA AGTTTCAACA AAAATTGATA ATCGGAATTA TACAGCAACC 
GCTGAACCAA TTGAATTGAA GCCTAAACAA ACGTACAACA CCTATACGAC AGAAAGCTAT 
ACTTTTACAA AAGAAGAAGA GGCAAAGGAA CAACAACAAG CACCCGAATA TACCAAAAAT 
GCGGCGCGCT ATTTCAAAGA GAACAAGCAA AGATGGCAAG GATATCTAGA TAAAACGTTT 
GATCAAAAGA AAACAGCAGA ATTTCCTGAA TATCAAAATG CGCTAGTCAA ATCGATTGAA 
ACGATTAATA CCAATTGGCG AAGTGCGGCA GGTGCCTTTA AGCATGACGG GATTGTTCCG 
TCCATGTCTT ATAAATGGTT TATTGGTATG TGGGCTTGGG ATTCGTGGAA AGCGGATGTA 
GCAACGGCTG ATTTTAATCC TGAGTTAGCT AAAAATAATA TGCGGGCCTT GTTTGATTAT 
CAAATTCAAA AAGATGATAC CGTACGTCCA CAAGATGCAG GAGCGATCAT TGATGCTGTC 
TTTTACAATC AAGACAGTGC GCGTGGTGGT GAAGGTGGCA ACTGGAATGA ACGAAATTCT 
AAACCACCAT TGGCTGCATG GGCAGTTTGG CATATTTATC AAGAAACCAA AGATAAGGAA 
TTTTTAAAAG AAATGTATCC CAAACTTGTG GCTTATCATA ATTGGTGGTA TACCAACAGA 
GACCACAATA AAAATGGGAT AGCAGAATAT GGAAGCATGG TCAGTGATGC TCACTGGCAA 
AAAGACGACA AGGATCAAAT CATTAAAGAT AAAAATGGCC ACCTAAAGTG GATGATGATG 
CTGTTATTGA AGCAGCCGCG TGGGAAAGTG GCATGGATAA CGCTACACGG TTTGACAAAG 
AAGGTGTGGG CAAAGGCGAC GTTGGAGTTA AAGTT 

EFlll-4 (SEQ ID NO:432) 

DEFA NVLDVSASPT ERTNGVYDTN YFNNFSDLGA 

WHGYYLPEKS NKELLGGFAG PLIIAEEYPV NLAASLNKLT VKNKKTGETY DLSQSNRMDL 
SYYPGRLEQT YELDDLTIHL ALIFVSNRTA LIQTTLENTG EEPLSLGASW TGAVFDKIQE 
GTETLDIGTR LTAKDNDIQV NFGEVRETWN YFATKDTKYT IHHADKVSTK IDNRNYTATA 
EPIELKPKQT YNTYTTESYT FTKEEEAKEQ QQAPEYTKNA ARYFKENKQR WQGYLDKTFD 
QKKTAEFPEY QNALVKSIET INTNWRSAAG AFKHDGIVPS MSYKWFIGMW AWDSWKADVA 
TADFNPELAK NNMRALFDYQ IQKDDTVRPQ DAGAIIDAVF YNQDSARGGE GGNWNERNSK 
PPLAAWAVWH lYQETKDKEF LKEMYPKLVA YHNWWYTNRD HNKNGIAEYG SMVSDAHWQK 
DDKDQIIKDK NGHLKWMMML LLKQPRGKVA WITLHGLTKK VWAKATLELK 
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EF117-1 (SEQ ID NO:433) 

TAATTCGATG GAGAAGGTGG TTTAGTGAAA AGATTTTCAT TTTTTTTACT AATTTTACTT 
GCTTTAACAG GTTGTAAATC CGGTGAAAAA GAATTTGATG AAGAATCTCT TCAAAATCTA 
AAGGAAACGN CACAGTCTTA NTCAGAAACA GAATTACAAA ATGGTGACGT TCGTTTAAAT 
GAATATATTT CTTTGAAAGG GGAGATTGTT GAGAGTGACA GTCGTTCCAG TTTAATAAAA 
AAAGGTGATC GTTTTATTTT GAAAAGTGGT TCTAGTAAAT ATCAAGTTTN TAATGAGCAA 
AAGAAAAAAT TGAAGATTGG TGACGAAGTG ACAGTTTACG GAGAATATTA CGGCTTTTTG 
AAAGGGACAT TAATTGAAAG TGAGGAGAAT CATGATTCAG CCACGAATTA G 

EF117-2 (SEQ ID NO:434) 

VKR FSFFLLILLiA LTGCKSGEKE FDEESLQNLK ETXQSXSETE LQNGDVRLNE 
YISLKGEIVE SDSRSSLIKK GDRFILKSGS SKYQVXNEQK KKLKIGDEVT VYGEYYGFLK 
GTLIESEENH DSATN 

EF117-3 (SEQ ID NO:435) 

TG AAGAATCTCT TCAAAATCTA 

AAGGAAACGN CACAGTCTTA NTCAGAAACA GAATTACAAA ATGGTGACGT TCGTTTAAAT 
GAATATATTT CTTTGAAAGG GGAGATTGTT GAGAGTGACA GTCGTTCCAG TTTAATAAAA 
AAAGGTGATC GTTTTATTTT GAAAAGTGGT TCTAGTAAAT ATCAAGTTTN TAATGAGCAA 
AAGAAAAAAT TGAAGATTGG TGACGAAGTG ACAGTTTACG GAGAATATTA CGGCTTTTTG 
AAAGGGACAT TAATTGAAAG TGAGGAGAAT CATGATTCAG CCACGAA 

EF117-4 (SEQ ID NO: 436) 

EESLQNLK ETXQSXSETE LQNGDVRLNE YISLKGEIVE SDSRSSLIKK GDRFILKSGS 
SKYQVXNEQK KKLKIGDEVT VYGEYYGFLK GTLIESEENH DSATN 

EF118-1 (SEQ ID NO:437) 

TGAGGGGGAA AAAGTGTGTT AAAAAGAAAA GTGGGGATTG TCGCAGGCGT TTTCTGTTCA 
GCTTTGTTAC TGACAGGTTG TGGCAAAAGT GCGAAAGATG AGTTCATTCA AGGAATCGGC 
AATCANAACG CACAAGAATC TGGGGTTTGN GATTTCTCTA TGTCAATTAG TGACATGAAA 
TTTTCACAAG AAGATGGTGC ACAAACGAAT CCTATGATTG GGATGCTCAT CACGCAAATC 
AAAGACGCAT CGCTTTCTGG GGAAGATTCA AGTAGATGCC AAAAAAGAAA AAGCATTCAA 
CTTAGAGATG AAATTAAAAG CGATGGGAAT GGATGTACCG ATTTCATTGG TTGGATCGTT 
AGATAA 

EF118-2 (SEQ ID NO:438) 

VLKRKV GIVAGVFCSA LLLTGCGKSA KDEFIQGIGN XNAQESGVXD FSMSISDMKF 
SQEDGAQTNP MIGMLITQIK DASLSGEDSS RCQKRKSIQL RDEIKSDGNG CTDFIGWIVR 

EF118-3 (SEQ ID NO:439) 

GAAAGATG AGTTCATTCA AGGAATCGGC 

AATCANAACG CACAAGAATC TGGGGTTTGN GATTTCTCTA TGTCAATTAG TGACATGAAA 

TTTTCACAAG AAGATGGTGC ACAAACGAAT CCTATGATTG GGAT<3CTCAT CACGCAAATC 

AAAGACGCAT CGCTTTCTGG GGAAGATTCA AGTAGATGCC AAAAAAGAAA AAGCATTCAA 

CTTAGAGATG AAATTAAAAG CGATGGGAAT GGATGTACCG ATTTCATTGG TTGGATCGTT 
AGAT 
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EF118-4 (SEQ ID NO:440) 

KDEFIQGIGN XNAQESGVXD FSMSISDMKF SQEDGAQTNP MIGMLITQIK DASLSGEDSS 
RCQKRKSIQL RDEIKSDGNG CTDFIGWIVR 

EF119-1 (SEQ ID NO:441) 

TAAAGAATAC CGAGTAAAAT TTTCGGAAGG CTTTTTTTCA AAAATTGTAT ATGCAAAAGA 
AGTGCAACGG AAAGGAGCTC GGAAATCGTG AATAAGCTAC CTTTACTTAT TTTATTGTTA 
GGCGGAGTGT TGCTTGTTAG TGGCTGTCAA AGCCATAAGG AAGAAAACAA GTCTAGTAAA 
GTATCGACAG AAGAAACGAC AGTGATTGAA ACAGTAGCAA GGGAACAATC GAAGGAATCG 
TTTACGAGTG AAGCAACTAA AAAACAGACA GAAACAACGA AATTAGAAGA ACCAGATCAT 
GTAAAACTTC TAGAAGCTTA TGGAAATGCG TATGCGAACT TTACAAGTAT TAATGATCGC 
AATGAAAAGC TAAAGCCCCT CATGACTGAA AAATGTATCA AAAAAAATGG AATTGATGTT 
AAAACTGGAG TAGCGTTAGT TTCCGTAGGA AAGGTTACAA CGATTTATAA AAATGATCAA 
CATGAATATG CTTTACTTTT GGATTGTGAA CAAAATGGAA CGCAGACACG AGTGTTACTT 
TTGGCTAAGG TGAAGAACAA TAAAATTTCT GAAATGACCT ATAATTCAGT TAAGCAAGAG 
TATTAG 



EF119-2 (SEQ ID NO:442) 

VN KLPLLILLLG GVLLVSGCQS HKEENKSSKV STEETTVIET VAREQSKESF TSEATKKQTE 
TTKLEEPDHV KLLEAYGNAY ANFTSINDRN EKLKPLMTEK CIKKNGIDVK TGVALVSVGK 
VTTIYKNDQH EYALLLDCEQ NGTQTRVLLL AKVKNNKISE MTYNSVKQEY 

EF119-3 (SEQ ID NO:443) 

AGAAAACAA GTCTAGTAAA 

GTATCGACAG AAGAAACGAC AGTGATTGAA ACAGTAGCAA GGGAACAATC GAAGGAATCG 
TTTACGAGTG AAGCAACTAA AAAACAGACA GAAACAACGA AATTAGAAGA ACCAGATCAT 
GTAAAACTTC TAGAAGCTTA TGGAAATGCG TATGCGAACT TTACAAGTAT TAATGATCGC 
AATGAAAAGC TAAAGCCCCT CATGACTGAA AAATGTATCA AAAAAAATGG AATTGATGTT 
AAAACTGGAG TAGCGTTAGT TTCCGTAGGA AAGGTTACAA CGATTTATAA AAATGATCAA 
CATGAATATG CTTTACTTTT GGATTGTGAA CAAAATGGAA CGCAGACACG AGTGTTACTT 
TTGGCTAAGG TGAAGAACAA TAAAATTTCT GAAATGACCT ATAATTCAGT TAAGCAAGAG 
TAT 

EF119-4 (SEQ ID NO:444) 

ENKSSKV STEETTVIET VAREQSKESF TSEATKKQTE TTKLEEPDHV KLLEAYGNAY 
ANFTSINDRN 

EKLKPLMTEK CIKKNGIDVK TGVALVSVGK VTTIYKNDQH EYALLLDCEQ NGTQTRVLLL 
AKVKNNKISE MTYNSVKQEY 

EF120-1 (SEQ ID N0:445) . 

TGAATAGGCG TGAAAAAGGG AATGTTAGCG TTTTTTGTCG TGCTAGCGGT TTTATCATTA 
ACTGCTTGTC GGGAACCAAA AGNAAAGAAA GTAACCGCTT CAACGGAGGC ATCCTCTAAA 
GTTGAAGAGA CGAATGAAAA AACGAGTGAA ACAATTGATA AGACAAACGA ACAAGCGAGC 
AGCAGTGTCG AGTCTAACGA ATCAGTGAAA AATGAAGAGC CGACAGCTGA TGGAAACAAT 
AGTCAGCTAA CTGTAGCTGA TTTAGATACT ACAGCGATTA ATGCTGGCGA TTTTACTACT 
TTAGTTGGAA TATGGAAAAA TGGTAAAGGA GAGAGTTTGA TCATTCATCC TGATGGTAGT 
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ACAAATACCG GAGGAATGAT TACGAAGGAT TCACCTACTG ATGAGTCGCG ACCAATTACA 
AGCTTAAGTA TTAGGTGGGG GCCTACTGGT GCTGCGCTAT TATTATATAA AATTGGTGTT 

EF120-2 (SEQ ID NO:446) 

VKKGMLAF FWLAVLSLT ACREPKXKKV TASTEASSKV EETNEKTSET IDKTNEQASS 
SVESNESVKN EEPTADGNNS QLTVADLDTT AINAGDFTTL VGIWKNGKGE SLIIHPDGST 
NTGGMITKDS PTDESRPITS LSIRWGPTGA ALLLYKIGV 

EF120-3 (SEQ ID NO:447) 

AAGAAA GTAACCGCTT CAACGGAGGC ATCCTCTAAA 

GTTGAAGAGA CGAATGAAAA AACGAGTGAA ACAATTGATA AGACAAACGA ACAAGCGAGC 
AGCAGTGTCG AGTCTAACGA ATCAGTGAAA AATGAAGAGC CGACAGCTGA TGGAAACAAT 
AGTCAGCTAA CTGTAGCTGA TTTAGATACT ACAGCGATTA ATGCTGGCGA TTTTACTACT 
TTAGTTGGAA TATGGAAAAA TGGTAAAGGA GAGAGTTTGA TCATTCATCC TGATGGTAGT 
ACAAATACCG GAGGAATGAT TACGAAGGAT TCACCTACTG ATGAGTCGCG ACCAATTACA 
AGCTTAAGTA TTAGGTGGGG GCCTACTGGT GCTGCGCTAT TATTATATAA AATTGGTGTT 

EF120-4 (SEQ ID NO:448) 

KKV TASTEASSKV EETNEKTSET IDKTNEQASS 

SVESNESVKN EEPTADGNNS QLTVADLDTT AINAGDFTTL VGIWKNGKGE SLIIHPDGST 
NTGGMITKDS PTDESRPITS LSIRWGPTGA ALLLYKIGV 



EF121-1 (SEQ ID NO:449) 

TGAAACACAA GGAGGAAATT TGTGAAAAAG 
CATTTTTTAA TGGCTGTTGC GTTGATAGCG 
GAAACAACGA GTCAACAAAG TTCAGAAGCA 
CAAGAACCAG TCATTACACA GGAAACAACA 
ACGAGTGACA GTGTCAAGCA GTCACAAGAA 
GAAACGTCAA TCGCTGAAAA AGAAGAAACG 
ACGTCAGATG TTCATGGTCA ATTATGGAAT 
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT 
ACCGTTTTAA TTGATAATGG CGACAATATT 
AATAAAGCGC CTTTAGTGAA TGAAAAGACC 
AAGTATGATG CAATGGTTTT GGGAAATCAT 
AAAATTCAAC AAGAAGCCAC TTTTCCAATC 
GGTCTTCGTT TTGTTGAAGG GACTACCACG 
CCAGATTTAA AAGTTGGGAT TATCGGCTTA 
CCTCGTGTTA CTTCGCTTAA TTTTTTACCT 
GAGTTGAAAG CTAACGATCA GGCTGACATT 
AATAGTGATC CGGCTGCCAG TGCCGACCAA 
TATATTCTGG GTCATGACCA CCTTTCTTTT 
ACTGTACCGG TAGGGGGACC GAAAGATACG 
GTTGCTAAAA ATGCCGATAA GTGGGAAGTG 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG 
CGAGCGTTTA TTCAGGAGGA GATCGGCACA 
ATTAAAGGAA TTCCCGAAGC ACAATTACAA 
GTTCAAAAAG AAGTAACGGG CGCACAATTA 
AAATTACCTG CGGGGAAGAT TTCCTATGCC 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA 



TTGAGCTTTA AAAAAGTGAA GTGGGGCATG 
CCAAGTGTTA CTAGTACGGC ATATGCAGTA 
GTAACAAGTA CCACCGATTC AAGTAGAAAA 
GACATCAAAC AAGAAGCACC AAATCAGGCT 
ACCACAGCAC CAACAGAGAC GACGAATTTA 
AGCACGCCGC AAAAAATAAC AATTTTAGGT 
TGGTCTTATG AAGATGATAA AGAACTACCA 
AACCAAGTCC GGGCACAAAA CCCAGCAGGC 
CAAGGCACTA TTTTAACAGA TGACTTGTAT 
CATCCAATGA TCACCGCCAT GAATGTGATG 
GAGTTTAATT TTGGTTTACC GTTAATCAAA 
TTGTCTGCGA ATACCTACAA TAAGGAAGAT 
AAGGAACTTG ATTTTAATCA AGATGGGCAG 
ACAATTCCGC ACATTCCTTT GTGGGATGGC 
TTGAAAGAAG AAGCAGAAAA AGCAGTTACT 
ATTGTTGCCT CGATTCATGC GGGACAACAA 
GTAATTGAAA ATGTCGCGGG GATTGATGCG 
ACCAAGCAAG GAGCAGCGCC GAATGGAAAA 
GGGACAGAAG TTGTCAAAAT TGATCTTTCA 
CAAGAAGGTA CAGCAACGAT TGTACCAACA 
GCAGCGACAA AAGAATACCA TGAAAAAACG 
GCAACAGCTG ATTTTTTACC AAAACAAGAA 
CCAACAGCGA TGATTTCTTT AATTAATAAC 
AGTGCGGCAG CGCTGTTTAA ATACGACAGT 
ACGATTTTTG ATATCTACAA ATACCCGAAT 
AACTTACTGA AGTATTTAGA AAAACAAGGG 



wo 98/50554 



PCT/US98/08959 



218 

TABLE 1 . Nucleotide and Amino Acid Seqeuences of E, faecaiis Genes. 

GCGTACTATA ACCAAACACA GCCAGATGAT TTGACCATTA GTTTTAATCC AAACATTCGT 
GTATATAACT ATGACATGAT TTCTGGAGTG GACTACAAGA TTGACATTTC AAAACCAGTG 
GGTGAACGAA TTGTAGATGC GAAAATTGAC GGCCAACCGC TGGATCCTGC CAAAGAATAT 
ACGATTGCTA TGAATAATTA TCGTTACGGC GGTTTAGCTA GCCAAGGGAT TCAAGTAGGG 
GAACCTATTA AAAATTCTGA TCCAGAAACC TTACGAGGAA TGATTGTTGA TTATATTAAG 
AAAAAAGGAA CTCTTGATCC AGAACAAGAA ATCGAACGAA ATTGGTCAAT TATTGGGACA 
AATTTTGATG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCGCCG CTATTACGAA ACAAGATGTC 
CGTAATGCGG GCTTTGATTT AGATAATGCA TATACCATTA TGCACACAAA TGACGTTCAT 
GGCCGACTAG AAGCAGGGAA AGGCGAATTA GGTATGGCGC GTCTAAAAAC CTTTAAAGAC 
CAAGAAAACC CAACCTTGAT GGTGGATGCA GGGGATGTTT TCCAAGGATT ACCAATCTCC 
AATTTCTCCA AAGGCGCGGA TATGGCCAAA GCAATGAATG AAGTTGGTTA TGATGCCATG 
GCGGTGGGAA ATCACGAGTT TGATTTTGGT TTAGAGATTG CACTAGGTTA TAAAGACCAA 
CTGAATTTTC CGATTTTATC TAGTAATACG TATTACAAAG ATGGCAGTGG ACGGGTTTTT 
GATCCGTATA CAATCGTAGA AAAATCCGGG AAAAAGTTTG CCATTGTAGG TGTGACGACC 
CCAGAAACAG CAACGAAAAC ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGACCCG 
ATTCCAGAAG TAGAAGCAGT GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT 
TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACGC CGCATATCTG <3CGTGGTGAT 
ACGCTAGCAG AAACCCTTAG TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA 
CATTCGCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA AAGTGATCTA TGCTCAAACA 
GGTAATTATT TAAATAATGT TGGGATCGTC ACAGCACCAG AGAGTGAACC AACTAAGAAA 
ACAACAAAAT OXSATTTCAGC AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGCAGTTAAA 
GCCATCGTTG ATGAAGCACG TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT 
TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC GCACAGGAGA GACCAACTTA 
GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT 
GATTTTGCAG TAACTAATGG TGGCGGCATT CGCGCTGATA TTAAACAAGG GCCAATTAAA 
GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC 
GGCGCCCAAG TTAAAGAAAT GTTTGAAATG TCTGTTCGTT CGATTCCACA AAAAGATGAG 
AATGGCACAA TTTTACTAGA TGATGCTGGC CAACCAAAAC TTGGCGCAAA TGGTGGTTTC 
CTACATGTTT CAAGCTCCAT TCGTATCCAC TATGATTCCA CAAAACCAGG TACTCGCTTG 
GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG GTAGTCGCGT ATTAGGAATA 
GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG 
ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTGGTGAA 
CGAGAAGAAG GGATTTCACT AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC 
TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA 
GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GCTTACAAAG AGTTAATCGG TGGAGGAGAG 
ACGCCAAAAC CAGATCCAAA ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGCAACC 
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG 
TATGGCGGCT TTTTACCTAA AACGGGTACA GAAACAGAAA CGCTTGCATT ATATGGTTTA 
CTGTTCGTTG GACTTTCTTC TTCTGGCTGG TATATTTATA AACGACGTAA CAAAGCTAGT 
TAG 

EF121-2 (SEQ ID NO:450) 

VKKL SFKKVKWGMH FLMAVALIAP SVTSTAYAVE TTSQQSSEAV TSTTDSSRKQ 
EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTETTNLE TSIAEKEETS TPQKITILGT 
SDVHGQIiWNW SYEDDKELPV GLSQVSTWN QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN 
KAPLVNEKTH PMITAMNVMK YDAMVLGNHE FNFGLPLIKK . IQQEATFPIL SANTYNKEDG 
LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE 
LKANDQADII VASIHAGQQN SDPAASADQV lENVAGIDAY IU3HDHLSFT KQGAAPNGKT 
VPVGGPKDTG TEWKIDLSV AKNADKWEVQ EGTATIVPTT NVPADEAVKA ATKEYHEKTR 
AFIQEEIGTA TADFLPKQEI KGIPEAQLQP TAMISLINNV QKEVTGAQLS AAALFKYDSK 
LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA YYNQTQPDDL TISFNPNIRV 
YNYDMISGVD YKIDISKPVG ERIVDAKIDG QPLDPAKEYT lAMNNYRYGG LASQGIQVGE 
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PIKNSDPETL RGMIVDYIKK KGTLDPEQEI ERNWSIIGTN FDEKWRAKAI ELVNDGTLQI 
PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVHG RLEAGKGELG MARLKTFKDQ 
ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDAMA VGNHEFDFGL EIALGYKDQL 
NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFAIVGVTTP ETATKTHPKN VEKVTFKDPI 
PEVEAVIKEI KEKYADXQAF WTGHLGVDE TTPHIWRGDT LAETLSQTYP ELDITVIDGH 
SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA 
IVDEARTNFN AENEKVIVDY IPFTLDGQRE NVRTRETNLG NLIGDAIMSY GQDAFSQPAD 
FAVTNGGGIR ADIKQGPIKV GDVIAVLPFG NSIAQIQVTG AQVKEMFEMS VRSIPQKDEN 
GTILLDDAGQ PKLGANGGFL HVSSSIRIHY DSTKPGTRLA SDEGNETGQT IVGSRVLGIE 
IKNRQTQKFE PLDEKKQYRM ATNDFLAAGG DGYDMLGGER EEGISLDSVL lEYLKSATSL 
RLYRAATTID LAQYKEPFPG ERIVSISEEA YKELIGGGET PKPDPKPDPK PTPETPVATN 
KQNQAGARQS NPSVTEKKKY GGFLPKTGTE TETLALYGLL FVGLSSSGWY lYKRRNKAS 

EF121-3 (SEQ ID NO:451) 

ACAAAG TTCAGAAGCA GTAACAAGTA CCACCGATTC AAGTAGAAAA 

CAAGAACCAG TCATTACACA GGAAACAACA GACATCAAAC AAGAAGCACC AAATCAGGCT 
ACGAGTGACA GTGTCAAGCA GTCACAAGAA ACCACAGCAC CAACAGAGAC GACGAATTTA 
GAAACGTCAA TCGCTGAAAA AGAAGAAACG AGCAC<3CCGC AAAAAATAAC AATTTTAGGT 
ACGTCAGATG TTCATGGTCA ATTATGGAAT TGGTCTTATG AAGATCATAA AGAACTACCA 
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT AACCAAGTCC GGGCACAAAA CCCAGCAGGC 
ACCGTTTTAA TTGATAATGG CGACAATATT CAAGGCACTA TTTTAACAGA TGACTTGTAT 
AATAAAGCGC CTTTAGTGAA TGAAAAGACC CATCCAATGA TCACCGCCAT <3AATGTGATG 
AAGTATGATG CAATGGTTTT GGGAAATCAT GAGTTTAATT TTGGTTTACC GTTAATCAAA 
AAAATTCAAC AAGAAGCCAC TTTTCCAATC TTGTCTGCGA ATACCTACAA TAAGGAAGAT 
GGTCTTCGTT TTGTTGAAGG GACTACCACG AAGGAACTTG ATTTTAATCA AGATGGGCAG 
CCAGATTTAA AAGTTGGGAT TATCGGCTTA ACAATTCCGC ACATTCCTTT GTGGGATGGC 
CCTCGTGTTA CTTCGCTTAA TTTTTTACCT TTGAAAGAAG AAGCAGAAAA AGCAGTTACT 
GAGTTGAAAG CTAACGATCA GGCTGACATT ATTGTTGCCT CGATTCATGC GGGACAACAA 
AATAGTGATC CGGCTGCCAG TGCCGACCAA GTAATTGAAA ATGTCGCGGG <3ATTGATGCG 
TATATTCTGG GTCATGACCA CCTTTCTTTT ACCAAGCAAG GAGCAGCGCC GAATGGAAAA 
ACTGTACCGG TAGGGGGACC GAAAGATACG GGGACAGAAG TTGTCAAAAT TGATCTTTCA 
GTTGCTAAAA ATGCCGATAA GTGGGAAGTG CAAGAAGGTA CAGCAACGAT TGTACCAACA 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG GCAGCGACAA AAGAATACCA TGAAAAAACG 
CGAGCGTTTA TTCAGGAGGA GATCGGCACA GCAACAGCTG ATTTTTTACC AAAACAAGAA 
ATTAAAGGAA TTCCCGAAGC ACAATTACAA CCAACAGCGA TGATTTCTTT AATTAATAAC 
GTTCAAAAAG AAGTAACGGG CGCACAATTA AGTGCGGCAG CGCTGTTTAA ATACGACAGT 
AAATTACCTG CGGGGAAGAT TTCCTATGCC ACGATTTTTG ATATCTACAA ATACCCGAAT 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA AACTTACTGA AGTATTTAGA AAAACAAGGG 
GCGTACTATA ACCAAACACA GCCAGATGAT TTGACCATTA GTTTTAATCC AAACATTCGT 
GTATATAACT ATGACATGAT TTCTGGAGTG GACTACAAGA TTGACATTTC AAAACCAGTG 
GGTGAACGAA TTGTAGATGC GAAAATTGAC GGCCAACCGC TGGATCCTGC CAAAGAATAT 
ACGATTGCTA TGAATAATTA TCGTTACGGC GGTTTAGCTA GCCAAGGGAT TCAAGTAGGG 
GAACCTATTA AAAATTCTGA TCCAGAAACC TTACGAGGAA TGATTGTTGA TTATATTAAG 
AAAAAAGGAA CTCTTGATCC AGAACAAGAA ATCGAACGAA ATTGGTCAAT TATTGGGACA 
AATTTTGATG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCG 

EF121-4 (SEQ ID NO:452) 



QSSEAV TSTTDSSRKQ 

EPVITQETTD IKQEAPNQAT SDSVKQSQET 
SDVHGQLWNW SYEDDKELPV GLSQVSTWN 



TAPTETTNLE TSIAEKEETS TPQKITIU3T 
QVRAQNPAGT VLIDNGDNIQ XSTILTODLYN 
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KAPLVNEKTH PMITAMNVMK YDAMVLGNHE 
LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT 
LKANDQADII VASIHAGQQN SDPAASADQV 
VPVGGPKDTG TEWKIDLSV AKNADKWEVQ 
AFIQEEIGTA TADFLPKQEI KGIPEAQLQP 
LPAGKISYAT IFDIYKYPNT LVSVPINGEN 
YNYDMISGVD YKIDISKPVG ERIVDAKIDG 
PIKNSDPETL RGMIVDYIKK KGTLDPEQEI 
PTSPDGRTPN A 



FNFGLPLIKK IQQEATFPIL SAOTTNKEDG 
IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE 
lENVAGIDAY ILGHDHLSFT KQGAAPNGKT 
EGTATIVPTT NVPADEAVKA ATKEYHEKTR 
TAMISLINNV QKEVTGAQLS AAALFKYDSK 
LLKYLEKQGA YYNQTQPDDL TISFNPNIRV 
QPLDPAKEYT lAMNNYRYGG LASQGIQVGE 
ERNWSIIGTN FDEKWRAKAI ELVNDGTLQI 



EF122-1 (SEQ ID NO: 453) 

TGAAACACAA GGAGGAAATT TGTGAAAAAG TTGAGCTTTA AAAAAGTGAA GTGGGGCATG 
CATTTTTTAA TGGCTGTTGC GTTGATAGCG CCAAGTGTTA CTAGTACGGC ATATGCAGTA 
GAAACAACGA GTCAACAAAG TTCAGAAGCA GTAACAAGTA CCACCGATTC AAGTAGAAAA 
CAAGAACCAG TCATTACACA GGAAACAACA GACATCAAAC AAGAAGCACC AAATCAGGCT 
ACGAGTGACA GTGTCAAGCA GTCACAAGAA ACCACAGCAC CAACAGAGAC GACGAATTTA 
GAAACGTCAA TCGCTGAAAA AGAAGAAACG AGCACGCCGC AAAAAATAAC AATTTTAGGT 
ACGTCAGATG TTCATGGTCA ATTATGGAAT TGGTCTTATG AAGATGATAA AGAACTACCA 
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT AACCAAGTCC GGGCACAAAA CCCAGCAGGC 
ACCGTTTTAA TTGATAATGG CGACAATATT CAAGGCACTA TTTTAACAGA TGACTTGTAT 
AATAAAGCGC CTTTAGTGAA TGAAAAGACC CATCCAATGA TCACCGCCAT GAATGTGATG 
AAGTATGATG CAATGGTTTT GGGAAATCAT GAGTTTAATT TTGGTTTACC GTTAATCAAA 
AAAATTCAAC AAGAAGCCAC TTTTCCAATC TTGTCTGCGA ATACCTACAA TAAGGAAGAT 
GGTCTTCGTT TTGTTGAAGG GACTACCACG AAGGAACTTG ATTTTAATCA AGATGGGCAG 
CCAGATTTAA AAGTTGGGAT TATCGGCTTA ACAATTCCGC ACATTCCTTT GTGGGATGGC 
CCTCGTGTTA CTTCGCTTAA TTTTTTACCT TTGAAAGAAG AAGCAGAAAA AGCAGTTACT 
GAGTTGAAAG CTAACGATCA GGCTGACATT ATTCTTGCCT CGATTCATGC GGGACAACAA 
AATAGTGATC CGGCTGCCAG TGCCGACCAA GTAATTGAAA ATGTCGCGGG GATTGATGCG 
TATATTCTGG GTCATGACCA CCTTTCTTTT ACCAAGCAAG GAGCAGCGCC GAATGGAAAA 
ACTGTACCGG TAGGGGGACC GAAAGATACG GGGACAGAAG TTGTCAAAAT TGATCTTTCA 
GTTGCTAAAA ATGCCGATAA GTGGGAAGTG CAAGAAGGTA CAGCAACGAT TGTACCAACA 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG GCAGCGACAA AAGAATACCA TGAAAAAACG 
CGAGCGTTTA TTCAGGAGGA GATCGGCACA GCAACAGCTG ATTTTTTACC AAAACAAGAA 
ATTAAAGGAA TTCCCGAAGC ACAATTACAA CCAACAGCGA TGATTTCTTT AATTAATAAC 
GTTCAAAAAG AAGTAACGGG CGCACAATTA AGTGCGGCAG CGCTGTTTAA ATACGACAGT 
AAATTACCTG CGGGGAAGAT TTCCTATGCC ACGATTTTTG ATATCTACAA ATACCCGAAT 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA AACTTACTGA AGTATTTAGA AAAACAAGGG 
GCGTACTATA ACCAAACACA GCCAGATGAT TTGACCATTA GTTTTAATCC AAACATTCGT 
GTATATAACT ATGACATGAT TTCTGGAGTG GACTACAAGA TTGACATTTC AAAACCAGTG 
GGTGAACGAA TTGTAGATGC GAAAATTGAC GGCCAACCGC TGGATCCTGC CAAAGAATAT 
ACGATTGCTA TGAATAATTA TCGTTACGGC GGTTTAGCTA GCCAAGGGAT TCAAGTAGGG 
GAACCTATTA AAAATTCTGA TCCAGAAACC TTACGAGGAA TGATTGTTGA TTATATTAAG 
AAAAAAGGAA CTCTTGATCC AGAACAAGAA ATCGAACGAA ATTGGTCAAT TATTGGGACA 
AATTTTGATG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCGCCG CTATTACGAA ACAAGATGTC 
CGTAATGCGG GCTTTGATTT AGATAATGCA TATACCATTA TGCACACAAA TGACGTTCAT 
GGCCGACTAG AAGCAGGGAA AGGCGAATTA GGTATGGCGC GTCTAAAAAC CTTTAAAGAC 
CAAGAAAACC CAACCTTGAT GGTGGATGCA GGGGATGTTT TCCAAGGATT ACCAATCTCC 
AATTTCTCCA AAGGCGCGGA TATGGCCAAA GCAATGAATG AAGTTGGTTA TGATGCCATG 
GCGGTGGGAA ATCACGAGTT TGATTTTGGT TTAGAGATTG CACTAGGTTA TAAAGACCAA 
CTGAATTTTC CGATTTTATC TAGTAATACG TATTACAAAG ATGGCAGTGG ACGGGTTTTT 
GATCCGTATA CAATCGTAGA AAAATCCGGG AAAAAGTTTG CCATTGTAGG TGTGACGACC 
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CCAGAAACAG CAACGAAAAC ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGACCCG 
ATTCCAGAAG TAGAAGCAGT GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT 
TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACGC CGCATATCOXS GCGTGGTGAT 
ACGCTAGCAG AAACCCTTAG TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA 
CATTCGCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA AAGTGATCTA TGCTCAAACA 
GGTAATTATT TAAATAATGT TGGGATCGTC ACAGCACCAG AGAGTGAACC AACTAAGAAA 
ACAACAAAAT TGATTTCAGC AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGCAGTTAAA 
GCCATCGTTG ATGAAGCACG TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT 
TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC GCACACGAGA GACCAACTTA 
GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT 
GATTTTGCAG TAACTAATGG TGGCGGCATT CGCGCTGATA TTAAACAAGG GCCAATTAAA 
GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC 
GGCGCCCAAG TTAAAGAAAT GTTTGAAATG TCTGTTCGTT CGATTCCACA AAAAGATGAG 
AATGGCACAA TTTTACTAGA TGATGCTGGC CAACGAAAAC TTGGCGCAAA TGGTGGTTTC 
CTACATGTTT CAAGCTCCAT TCGTATCCAC TATGATTCCA CAAAACCAGG TACTCGCTTG 
GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG GTAGTCGCGT ATTAGGAATA 
GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG 
ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTGGTGAA 
CGAGAAGAAG GGATTTCACT AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC 
TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA 
GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GCTTACAAAG AGTTAATCGG TGGAGGAGAG 
ACGCCAAAAC CAGATCCAAA ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGCAACC 
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG 
TATGGCGGCT TTTTACCTAA AACGGGTACA GAAACAGAAA CGCTTGCATT ATATGGTTTA 
CTGTTCGTTG GACTTTCTTC TTCTGGCTGG TATATTTATA AACGACGTAA CAAAGCTAGT 
TAG 



EF122-2 (SEQ ID NO:454) 

VKKL SFKKVKWGMH FLMAVALIAP SVTSTAYAVE TTSQQSSEAV TSTTDSSRKQ 
EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTETTNLE TSIAEKEETS TPQKITILGT 
SDVHGQLWNW SYEDDKELPV GLSQVSTWN QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN 
KAPLVNEKTH PMITAMNVMK YDAMVLGNHE FNFGLPLIKK IQQEATFPIL SANTYNKEDG 
LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE 
LKANDQADII VASIHAGQQN SDPAASADQV lENVAGIDAY ILGHDHLSFT KQGAAPNGKT 
VPVGGPKDTG TEWKIDLSV AKNADPCWEVQ EGTATIVPTT NVPADEAVKA ATKEYHEKTR 
AFIQEEIGTA TADFLPKQEI KGIPEAQLQP TAMISLINNV QKEVTGAQLS AAALFKYDSK 
LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA YYNQTQPDDL TISFNPNIRV 
YNYDMISGVD YKIDISKPVG ERIVDAKIDG QPLDPAKEYT lAMNNYRYGG LASQGIQVGE 
PIKNSDPETL RGMIVDYIKK KGTLDPEQEI ERNWSIIGTN FDEKWRAKAI ELVNDGTLQI 
PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVHG RLEAGKGELG MARLKTFKDQ 
ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDAMA VGNHEFDFGL EIAU3YKDQL 
NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFAIVGVTTP ETATKTHPKN VEKVTFKDPI 
PEVEAVIKEI KEKYADXQAF WTGHLGVDE TTPHIWRGDT LAETLSQTYP ELDITVIDGH 
SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA 
IVDEARTNFN AENEKVIVDY IPFTLDGQRE NVRTRETNLG NLIGDAIMSY GQDAFSQPAD 
FAVTNGGGIR ADIKQGPIKV GDVIAVLPFG NSIAQIQVTG AQVKEMFEMS VRSIPQKDEN 
GTILLDDAGQ PKLGANGGFL HVSSSIRIHY DSTKPGTRLA SDEGNETGQT IVGSRVLGIE 
IKNRQTQKFE PLDEKKQYRM ATNDFLAAGG DGYDMLGGER EEGISLDSVL lEYLKSATSL 
RLYRAATTID LAQYKEPFPG ERIVSISEEA YKELIGGGET PKPDPKPDPK PTPETPVATN 
KQNQAGARQS NPSVTEKKKY GGFLPKTGTE TETLALYGLL FVGLSSSGWY lYKRRNKAS 



EF122-3 (SEQ ID NO:455) 
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TG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCGCCG CTATTACGAA ACAAGATGTC 
CGTAATGCGG GCTTTGATTT AGATAATGCA TATACCATTA TGCACACAAA TCACGTTCAT 
GGCCGACTAG AAGCAGGGAA AGGCGAATTA GGTATGGCGC GTCTAAAAAC CTTTAAAGAC 
CAAGAAAACC CAACCTTGAT GGTGGATGCA GGGGATGTTT TCCAAGGATT ACCAATCTCC 
AATTTCTCCA AAGGCGCGGA TATGGCCAAA GCAATGAATG AAGTTGGTTA TGATGCCATG 
GCGGTGGGAA ATCACGAGTT TGATTTTGGT TTAGAGATTG CACTAGGTTA TAAAGACCAA 
CTGAATTTTC CGATTTTATC TAGTAATACG TATTACAAAG ATGGCAGTGG ACGGGTTTTT 
GATCCGTATA CAATCGTAGA AAAATCCGGG AAAAAGTTTG CCATTGTAGG TGTGACGACC 
CCAGAAACAG CAACGAAAAC ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGACCCG 
ATTCCAGAAG TAGAAGCAGT GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT 
TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACGC CGCATATCTG GCGTGGTGAT 
ACGCTAGCAG AAACCCTTAG TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA 
CATTCGCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA AAGTGATCTA TGCTCAAACA 
GGTAATTATT TAAATAATGT TGGGATCGTC ACAGCACCAG AGAGTGAACC AACTAAGAAA 
ACAACAAAAT TGATTTCAGC AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGCAGTTAAA 
GCCATCGTTG ATGAAGCACG TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT 
TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC GCACACGAGA GACCAACTTA 
GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT 
GATTTTGCAG TAACTAATGG TGGCGGCATT CGCGCTGATA TTAAACAAGG GCCAATTAAA 
GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC 
GGCGCCCAAG TTAAAGAAAT GTTTGAAATG TCTGTTCGTT CGATTCCACA AAAAGATGAG 
AATGGCACAA TTTTACTAGA TGATGCTGGC CAACGAAAAC TTGGCGCAAA TGGTGGTTTC 
CTACATGTTT CAAGCTCCAT TCGTATCCAC TATGATTCCA CAAAACCAGG TACTCGCTTG 
GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG GTAGTCGCGT ATTAGGAATA 
GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG 
ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTGGTGAA 
CGAGAAGAAG GGATTTCACT AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC 
TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA 
GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GCTTACAAAG AGTTAATCGG TGGAGGAGAG 
ACGCCAAAAC CAGATCCAAA ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGCAACC 
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG 
TATGGCGGCT TT 

EF122-4 {SEQ ID NO:456) 

EKWRAKAI ELVNDGTLQI 

PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVHG RLEAGKGELG MARLKTFKDQ 
ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDAMA VGNHEFDFGL EIALGYKDQL 
NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFAIVGVTTP ETATKTHPKN VEKVTFKDPI 
PEVEAVIKEI KEKYADXQAF WTGHLGVDE TTPHIWRGDT LAETLSQTYP ELDITVIDGH 
SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA 
IVDEARTNFN AENEKVIVDY IPFTLDGQRE NVRTRETNLG NLIGDAIMSY GQDAFSQPAD 
FAVTNGGGIR ADIKQGPIKV GDVIAVLPFG NSIAQIQVTG AQVKEMFEMS VRSIPQKDEN 
GTILLDDAGQ PKLGANGGFL HVSSSIRIHY DSTKPGTRLA SDEGNETGQT IVGSRVLGIE 
IKNRQTQKFE PLDEKKQYRM ATNDFLAAGG DGYDMLGGER EEGISLDSVL lEYLKSATSL 
RLYRAATTID LAQYKEPFPG ERIVSISEEA YKELIGGGET PKPDPKPDPK PTPETPVATN 
KQNQAGARQS NPSVTEKKKY GGF 



EF123-1 (SEQ ID NO:457) 

TAAAATAAAA AATTGGTACG AAGTGAACGT 
ATGAAAGAAA TGAGAAAGAA TGGTCCAATG 



TCTCTTCTAT GTGTCGTTAG TAGAGGAAGG 
GTAAACCGTT GGCTCTACGG GTTGATGTGT 
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TTGTTACTTG TTCTAAATTA TGGCACACCA CTCATGGCTT TGGCGGAAGA GGTTAACAGC 
GATGGCCAGT TAACGTTAGG AGAAGTGAAG CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
CTTCAAGGAA AAGCACAACC AGTAACACAA GAGGTTGTAG TGCATTATAG TGCCAATGTG 
TCAATCAAAG CTGCACATTG GGCAGCGCCC AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTGAACC CTACAGCTAC AGAAGATGTG ACGTTTTCTT ATGGACAACA GCAACGAGCG 
TTGACGTTAA AGACTGGTAC TGATCCGACA GAATCAACGG CAATCACGAG TTCGCCAGCG 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA GCGATAGAAA GTAAAACAAC TGAATCGACG 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA GATATCAGTG ATTATTTTAC AGGTGATGAA 
ACAACGATTA TCGATAATTT TGAAGATCCG ATTTATTTAA ATCCTGATGG AACACCAGCA 
ACACCGCCGT ATAAAGAAGA TGTGACCATT CATTGGAACT TTAACTGGTC GATTCCAGAA 
GATGTGCGAG AACAAATGAA AGCAGGCGAT TACTTCGAGT TTCAATTACC TGGCAATTTG 
AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATGT CTATGGAACC 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA -AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
GTGATGAATC TTGATGGAAC AATTAAAGAA GTGGGTCGCG AACTTAGTCC AGATGAATAT 
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGGTGACA CCAACAAAGC GTATCGTCTT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT AATCCAAATG GGTTAGATGC TGAAGCAACT 
GTTACCGCCA CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACGA CGAAGCCAAT 
CAAGAATTCA CTTGGGAAAT TAACTACAAC TATGGTGAAC AAACCATTCC AAAAGACCAA 
GCAGTCATTA CAGACACAAT GGGGGATAAT TTAACGTTTG AACCAGATTC TTTACATTTA 
TATTCAGTGA CATTTGATGA CAAAGGAAAT GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCCTTTG CAATTGACTT TTTACATGAT 
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GCCAGTCAAC AAAATATTAT TAAAAACACT GGTGCAGTTG ATTATCAAAA TTCAACGATT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACCCAATT CGTTGGTTGT CAAAGATACA 
ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGGACGG ATGAAGCAGG AAACAATCAT 
CATTCAGAAG ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA TGCGCAAAAA 
AGCGGTGTTT ACAATGCCGT CACCAAAGAA ATCACTTGGA CGATTGCGGT TAATTTAAGT 
AATAATCGTT TAGTCGACGC CTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTG ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
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CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGGCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT GCAGAGCACT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTCAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTCGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA 
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GCGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
CTTGGTTTAC CGAAAACCAA CACACAAGTC AATTACTTCT TTGTCTTTAT CGGCCTCATG 
TTGGTCGGTT TGGCAAGTTG GCTCTTCTAT AAAAAGAGCA AGAAATAA 

EF123-2 (SEQ ID NO: 458) 

MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD 

GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE VWHYSANVS IKAAHWAAPN NTRKIQVDDQ 
KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA 
SANEGSTEEA STNSSVPRSS EETVASTTKA lESKTTESTT VKPRVAGPTD ISDYFTGDET 
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TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK 
PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP 
GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 
VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 
YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKMLDK RNIDYDEANQ 
EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE WGAELVEGK 
DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA 
SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLWKDTT 
TGAQLTLGKD FMVEITRNAD GETGFKVSF I GAYAKTSDAF HITYTTFFDV TELDANNPAL 
DHYRNTAAID WTDEAGNNHH SEDSKPFKPL PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT KPDGSVEKVK PTQPLTDIIM EEPSEKNQNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
PDKSVILEEG KDYTLEVTTD NETGQQKIW KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
QVSITGNGSE WHGDDNGDV WDIDHSGGH ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
QAKTQVLREG TVDATGVITF GGLPQGQYIL VETKAPEGYT VSDELAKGRV ITIDEETSAE 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN GKVTVTDLAP GKYQPVETKA PAGYLLNTEP 
SAFTIAASDR GKPATVIATA NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 
TTNNQGEIVA EHLAPGKYRF VETKAPTGYL LNTTPVPFEI AEKNAGKPAV WASDNFVSY 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS LGITATSGKD GKIIFRDLAP GTYYYKEIKA 
PKLPDGADYI lYPELVKVEI RGDFKGDPEI FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLiAPGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WLKAPFINY QGAAKLVKID QQKNALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFTIAATA 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWL GLPKTNTQVN YFFVFIGLML 
VGLASWLFYK KSKK 

EF123-3 (SEQ ID NO:459)' 

GGAAGA GGTTAACAGC 

GATGGCCAGT TAACGTTAGG AGAAGTGAAG CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
CTTCAAGGAA AAGCACAACC AGTAACACAA GAGGTTGTAG TGCATTATAG TGCCAATGTG 
TCAATCAAAG CTGCACATTG GGCAGCGCCC AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTGAACC CTACAGCTAC AGAAGATGTG ACGTTTTCTT ATGGACAACA GCAACGAGCG 
TTGACGTTAA AGACTGGTAC TGATCCGACA GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA GCGATAGAAA GTAAAACAAC TGAATCGACG 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA GATATCAGTG ATTATTTTAC AGGTGATGAA 
ACAACGATTA TCGATAATTT TGAAGATCCG ATTTATTTAA ATCCTGATGG AACACCAGCA 
ACACCGCCGT ATAAAGAAGA TGTGACCATT CATTGGAACT TTAACTCGTC GATTCCAGAA 
GATGTGCGAG AACAAATGAA AGCAGGCGAT TACTTCGAGT TTCAATTACC TGGCAATTTG 
AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATGT CTATGGAACC 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
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GTGATGAATC 
ACCGTTGATA 
GAGTACCAAA 
AAAAATCACG 
GTTACCGCCA 
CAAGAATTCA 
GCAGTCATTA 
TATTCAGTGA 
AAAGATTACA 
GTGACTGGCG 
GTTGCCGTGA 
GCCAGTCAAC 
GGTTGGACGT 
ACGTACGAAC 
ACCACTGGTG 
GATGGTGAAA 
TTCCACATAA 
TTGGACCATT 



TTGATGGAAC 
AAAATGGCAA 
CGACGATTGA 
CGACGTTAAC 
CATATGGCAA 
CTTGGGAAAT 
CAGACACAAT 
CATTTGATGA 
AAGTGGTAAT 
CAGTCAAGAT 
ATAATCGTGT 
AAAATATTAT 
TAGCTGTGAA 
CAGTTCCTGG 
CTCAGTTGAC 
CAGGCTTTAA 
CTTATACTAC 
ATCGAAATAC 



AATTAAAGAA 
TGTGACGATT 
CGAGGCGGTT 
AAGTGATAAT 
AATGTTAGAC 
TAACTACAAC 
GGGGGATAAT 
CAAAGGAAAT 
CAACGGAGAC 
TGATTATAAA 
GGATGTTGGC 
TAAAAACACT 
TCAAAATAAT 
CTTAACTATG 
GTTAGGCAAG 
GGTAAGTTTT 
CTTTTTCGAT 
CGCTGCCATT 



GTGGGTCGCG 
AAAGGTGACA 
ATTCCAGATG 
AATCCAAATG 
AAGCGCAATA 
TATGGTGAAC 
TTAACGTTTG 
GAAGTCGTTG 
GGTTCCTTTG 
ACCAAAGTTG 
ACTGGTCAGC 
GGTGCAGTTG 
TATTTGATGG 
GTACCCAATT 
GATTTCATGG 
ATAGGGGCGT 
GTTACCGAGT 
GATTGG 



AACTTAGTCC 
CCAACAAAGC 
GCGGCGGGGA 
GGTTAGATGC 
TAGATTACGA 
AAACCATTCC 
AACCAGATTC 
GAGCAGAACT 
CAATTGACTT 
ATGGAATTGT 
ATTCAGAAGA 
ATTATCAAAA 
AAAATGCCGT 
CGTTGGTTGT 
TAGAAATAAC 
ATGCCAAAAC 
TAGACGCTAA 



AGATGAATAT 
GTATCGTCTT 
TGTGCCTTTT 
TGAAGCAACT 
CGAAGCCAAT 
AAAAGACCAA 
TTTACATTTA 
TGTGGAAGGA 
TTTACATGAT 
CGAAGGCGAT 
TGATGGCACA 
TTCAACGATT 
GATTACGGAT 
CAAAGATACA 
TCGTAATGCA 
AAGTGATGCC 
CAATCCTGCG 



EF123-4 (SEQ ID NO:460) 
EEVNSD 

GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE VWHYSANVS IKAAHWAAPN NTRKIQVDDQ 
KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA 
SANEGSTEEA STNSSVPRSS EETVASTTKA lESKTTESTT VKPRVAGPTD ISDYFTGDET 
TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK 
PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP 
GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 
VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 
YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKMLDK RNIDYDEANQ 
EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE WGAELVEGK 
DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA 
SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLWKDTT 
TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV TELDANNPAL 
DHYRNTAAID W 



EF124-1 (SEQ ID NO:461) 

TAAAATAAAA AATTGGTACG AAGTGAACGT 
ATGAAAGAAA TGAGAAAGAA TGGTCCAATG 
TTGTTACTTG TTCTAAATTA TGGCACACCA 
GATGGCCAGT TAACGTTAGG AGAAGTGAAG 
CTTCAAGGAA AAGCACAACC AGTAACACAA 
TCAATCAAAG CTGCACATTG GGCAGCGCCC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT 
ACGTTGAACC CTACAGCTAC AGAAGATGTG 
TTGACGTTAA AGACTGGTAC TGATCCGACA 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA 
ACAACGATTA TCGATAATTT TGAAGATCCG 
ACACCGCCGT ATAAAGAAGA TGTGACGATT 
GATGTGCGAG AACAAATGAA AGCAGGCGAT 



TCTCTTCTAT GTGTCGTTAG TAGAGGAAGG 
GTAAACCGTT GGCTCTACGG GTTGATGTGT 
CTCATGGCTT TGGCGGAAGA GGTTAACAGC 
CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
GAGGTTGTAG TGCATTATAG TGCCAATGTG 
AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTTTCTT ATGGACAACA GCAACGAGCG 
GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
GCGATAGAAA GTAAAACAAC TGAATGGACG 
GATATCAGTG ATTATTTTAC AGGTGATGAA 
ATTTATTTAA ATCCTGATGG AACACCAGCA 
CATTGGAACT TTAACTGGTC GATTCCAGAA 
TACTTCGAGT TTCAATTACC TGGCAATTTG 
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AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATCT CTATGGAACC 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AAG-GCCCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
GTGATGAATC TTGATGGAAC AATTAAAGAA GTGGGTCGCG AACTTAGTCC AGATGAATAT 
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGGTGACA CCAACAAAGC GTATCGTCTT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT AATCCAAATG GGTTAGATGC TGAAGCAACT 
GTTACCGCCA CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACGA CGAAGCCAAT 
CAAGAATTCA CTTGGGAAAT TAACTACAAC TATGGTGAAC AAACCATTCC AAAAGACCAA 
GCAGTCATTA CAGACACAAT GGGGGATAAT TTAACGTTTG AACCAGATTC TTTACATTTA 
TATTCAGTGA CATTTGATGA CAAAGGAAAT GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCCTTTG CAATTGACTT TTTACATGAT 
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GCCAGTCAAC AAAATATTAT TAAAAACACT GGTGCAGTTG ATTATCAAAA TTCAACGATT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACCCAATT GGTTGGTTGT CAAAGATACA 
ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATCCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGGACGG ATGAAGCAGG AAACAATCAT 
CATTCAGAAG ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA TGCGCAAAAA 
AGCGGTGTTT ACAATGCCGT CACCAAAGAA ATCACTTGGA CGATTGCGGT TAATTTAAGT 
AATAATCGTT TAGTCGACGC CTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGAGGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTG ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAAGGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
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CCAAGTGCTT 
GCTAACTTTG 
TTATTAAGTG 
TTGACGACAA 
TTTGTAGAAA 
ATTGCTGAGA 
TACAAAGGGG 
GTTTTTGAAT 
GATGGCAAAA 
GCACCAAAAT 
ATTCGTGGTG 
AAAGGACGCG 
TTTAAATTGT 
AAAGAT6GTT 
GATGCAACGG 
TCAAATGATA 
GGACGTAAAG 
GCCGATGAGC 
AAAGTTTCTG 
GGGCATTACG 
CATCCATTTG 
ATCAATTATC 
GGTG.CGGTGT 
ATGTCTGACA 
GTGGAGACAC 
ATTGCCAAAG 
TACCAAGGTG 
GAATTTAAAG 
AACCAAGGGT 
AAAGCACCGG 
GCGAAAGACA 
TCCAAAAAAA 
CTTGGTTTAC 
TTGGTCGGTT 



TCACGATTGC 
TTAACTATCA 
GTGCGACATT 
ATAATCAAGG 
CCAAAGCGCC 
AAAATGCTGG 
CTTTCCAAAT 
TATATGATCA 
TTATCTTTAG 
TACCAGATGG 
ATTTCAAAGG 
CCGTCTTTAA 
ATCGAATCGA 
CATTGGCTAT 
ATGGCTATAT 
AACAACCACT 
TCAACGAGCA 
AGAATCAGCC 
AAATAACAAC 
TTTTAGTGGA 
ATGTAACCGC 
AAGGAACTGC 
TTAAGGTCAT 
AGCAAGGCAA 
AAGCGCCAAC 
ACAACCAAGG 
CTGCCAAGCT 
TGACAGATGC 
TAGTTCAAGT 
ATGGTTACCA 
AACCTGAACT 
CAAAACCAAA 
CGAAAACCAA 
TGGCAAGTTG 



AGCAAGCGAT 
AGGCACGGCT 
TAAAGTGCTT 
GGAAATTGTT 
AACAGGCTAT 
TAAACCAGCG 
CGTGAAAACG 
CAATAAACAA 
AGACTTGGCG 
CGCAGATTAT 
TGATCCGGAG 
GAAAATTGAT 
AAACGGGGAA 
GGAGGATTTA 
CGTCAATAAA 
AGATGAGTTA 
AGGTCAAACC 
CCAAGGTTCA 
GGATAAGACT 
AACGAAAGCA 
CCAATTAGGA 
TCAATTAACC 
TGATGAAACA 
AGTCATTGCG 
TAGCTATCTT 
CAAACCAGCC 
GGTGAAAATT 
AGAGACAGGG 
GAACCACTTA 
ACTGTCTAAG 
CGTGAATGCG 
TCAGCCAACA 
CACACAAGTC 
GCTCTTCTAT 



CGGGGCAAAC 
AAATTAATCA 
GATGCGAAGG 
GCAGAGCACT 
TTATTAAATA 
GTCGTGGTTG 
AATAGCGCAG 
TCATTAGGGA 
CCAGGTACCT 
ATTATTTATC 
ATTTTCCAAT 
GCCAATGCGA 
AAAATCTTTG 
GGTGCTGGTA 
CAACCCATTT 
GAGTTTGTAA 
TTAGCGGGTG 
CCGATAACAT 
GGCGAAATTT 
CCAACAGGCT 
AAAGAGCAGC 
AAAGAAAACG 
GGGCAAACCG 
AAAAACTTAG 
CTTAATGAAA 
ACTGTGGTAC 
GATCAGCAAA 
CAAACTGTCG 
CAACCAGGAA 
CAAGCTGTCG 
GGCACGTTTG 
ACGAAACAAG 
AATTACTTCT 
AAAAAGAGCA 



CAGCAACAGT 
AAAAAGATGT 
GAGAAACGAT 
TAGCCCCAGG 
CCACGCCAGT 
CTAGTGACAA 
ACCAACCATT 
TTACAGCAAC 
ATTATTACAA 
CTGAATTAGT 
TAGGGGCCTT 
ACCCACTTCC 
AAAGAGAAGT 
GCTATGAATT 
ATTTTGTAGT 
ATTATCAAGC 
CAGTTTTTGC 
TCTTGAATCG 
ACGCTAAAGG 
ATCTGTTAGA 
CAATTGCTTT 
AAACAGGTGA 
TAGATGGACA 
CACCGGGAAC 
CGCCAAGCGC 
TTAAAGCACC 
AGAATGCCTT 
CTCGTTCATT 
AATATACCTT 
CATTCACTAT 
TTAACGAGAA 
CAGCTAGAGA 
TTGTCTTTAT 
AGAAATAA 



TATAGCAACG 
GAATGGACAC 
TCAAACAGGC 
AAAATATCGC 
CCCATTTGAA 
CTTTGTGAGT 
AGCAGGTGCT 
GAGTGGCAAA 
AGAAATCAAA 
AAAAGTAGAA 
CGCCAATTTC 
AGGAACGATT 
AACTGCTGAA 
AGATGAACTG 
GAAGAAGAAT 
AGAAGTAATG 
AATTTACAAT 
TGCAGGAGAA 
GCTAAATGAA 
CACAACGCTA 
AGGCGATCTT 
AGCATTGGCA 
AACCAATCTG 
GTATCGTTTT 
AAGCTTTACG 
TTTTATTAAT 
AGCAGGTGCT 
ACGTTCTGAC 
TGTGGAAACA 
TGCGGCAACA 
ACAACCTGTA 
GACAGGTTGG 
CGGCCTCATG 



EF124-2 (SEQ ID NO:462) 

MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD 

GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE WVHYSMfPJS IKAAHWAAPN NTRKIQVDDQ 
KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA 
SANEGSTEEA STNSSVPRSS EETVASTTKA lESKTTESTT VKPRVAGPTD ISDYFTGDET 
TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK 
PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP 
GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 
VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 
YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKiaLDK RNIDYDEANQ 
EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE WGAELVEGK 
DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVBGDV AVNNRVDVGT GQHSEDDGTA 
SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLWKDTT 
TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV TELDANNPAL 
DHYRNTAAID WTDEAGNNHH SEDSKPFKPL PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT KPDGSVEKVK PTQPLTDITM EEPSEKNQNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
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PDKSVILEEG KDYTLEVTTD NETGQQKIW KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
QVSITGNGSE WHGDDNGDV WDIDHSGGH ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
QAKTQVLREG TVDATGVITF GGLPQGQYIL VETKAPEGYT VSDELAKGRV ITIDEETSAE 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
SAFTIAASDR GKPATVIATA NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 
TTONQGEIVA EHLAPGKYRF VETKAPTGYL L^rTTPVPFEI AEKNAGKPAV WASDNFVSY 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS LGITATSGKD GKIIFRDLAP GTYYYKEIKA 
PKLPDGADYI lYPELVKVEI RGDFKGDPEI FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WLKAPFINY QGAAKLVKID QQKNALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APIDGYQLSKQ AVAFTIAATA 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWL GLPKTNTQVN YFFVFIGLML 
VGLASWLFYK KSKK 

EF124-3 (SEQ ID NO: 463) 

TGCCTTCCACATAACTTATACTACCTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGITATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTAGGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG G 



EF124-4 (SEQ ID NO:464) 
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AF HITYTTFFDV TELDANNPAL 
DHYRNTAAID WTDEAGNNHH SEDSKPFKPL 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW 
PDKSVILEEG KDYTLEVTTD NETGQQKIW 
QVSITGNGSE WHGDDNGDV WDIDHSGGH 
QAKTQVLREG TVDATGVITF GGLPQGQYIL 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN 
SAFTIAASDR GKPATVIATA NFVNYQGTAK 
TTNNQG 



PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN 
KPDGSVEKVK PTQPLTDITM EEPSEKNQNT 
YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 
ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
VETKAPEGYT VSDELAKGRV ITIDEETSAE 
FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
LIKKDVNGHL LSGATFKVLD AKGETIQTGL 



EF125-1 (SEQ ID NO:465) 

TAAAATAAAA AATTGGTACG AAGTGAACGT 
ATGAAAGAAA TGAGAAAGAA TGGTCCAATG 
TTGTTACTTG TTCTAAATTA TGGCACACCA 
GATGGCCAGT TAACGTTAGG AGAAGTGAAG 
CTTCAAGGAA AAGCACAACC AGTAACACAA 
TCAATCAAAG CTGCACATTG GGCAGCGCCC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT 
ACGTTGAACC CTACAGCTAC AGAAGATGTG 
TTGACGTTAA AGACTGGTAC TGATCCGACA 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA 
ACAACGATTA TCGATAATTT TGAAGATCCG 
ACACCGCCGT ATAAAGAAGA TGTGACCATT 
GATGTGCGAG AACAAATGAA AGCAGGCGAT 
AAACCTAATA AACCAGGTTC AGGTGATTTA 
TACACAATTA GTGAAGATGG TACGGTTCGT 
AGTGACATTC ACGGGGACTT TTCTTTAGAT 
CCAGGAGATT GGGTGATTGA TATTCCTACA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC 
ACTGTGACGG AAACATGGCC AACAGGGAAT 
GTGATGAATC TTGATGGAAC AATTAAAGAA 
ACCGTTGATA AAAATGGCAA TGTGACGATT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT 
GTTACCGCCA CATATGGCAA AATGTTAGAC 
CAAGAATTCA CTTGGGAAAT TAACTACAAC 
GCAGTCATTA CAGACACAAT GGGGGATAAT 
TATTCAGTGA CATTTGATGA CAAAGGAAAT 
AAAGATTACA AAGTGGTAAT CAACGGAGAC 
GTGACTGGCG CAGTCAAGAT TGATTATAAA 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC 
GCCAGTCAAC AAAATATTAT TAAAAACACT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG 



TCTCTTCTAT GTGTCGTTAG TAGAGGAAGG 
GTAAACCGTT GGCTCTACGG GTTGATGTGT 
CTCATGGCTT TGGCGGAAGA GGTTAACAGC 
CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
GAGGTTGTAG TGCATTATAG TGCCAATGTG 
AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTTTCTT ATGGACAACA GCAACGAGCG 
GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
GCGATAGAAA GTAAAACAAC TGAATGGACG 
GATATCAGTG ATTATTTTAC AGGTGATGAA 
ATTTATTTAA ATCCTGATGG AACACCAGCA 
CATTGGAACT TTAACTGGTC GATTCCAGAA 
TACTTCGAGT TTCAATTACC TGGCAATTTG 
GTTGATGCAG AAGGCAATGT CTATGGAACC 
TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
AAACAAGGCC ATTTTGATCG AACGCCCAAT 
AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
GTGGGTCGCG AACTTAGTCC AGATGAATAT 
AAAGGTGACA CCAACAAAGC GTATCGTCTT 
ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AATCCAAATG GGTTAGATGC TGAAGCAACT 
AAGCGCAATA TAGATTACGA CGAAGCCAAT 
TATGGTGAAC AAACCATTCC AAAAGACCAA 
TTAACGTTTG AACCAGATTC TTTACATTTA 
GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
GGTTCCTTTG CAATTGACTT TTTACATGAT 
ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GGTGCAGTTG ATTATCAAAA TTCAACGATT 
TATTTGATGG AAAATGCCGT GATTACGGAT 
GTACCCAATT CGTOXSGTTGT CAAAGATACA 
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ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGGACGG ATGAAGCAGG AAACAATCAT 
CATTCAGAAG ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA TGCGCAAAAA 
AGCGGTGTTT ACAATGCCGT CACCAAAGAA ATCACTTGGA CGATTGCGGT TAATTTAAGT 
AATAATCGTT TAGTCGACGC CTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTC 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT GCAGAGCACT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG -GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA -CACAACGCTA 
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
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ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GGGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
CTTGGTTTAC CGAAAACCAA CACACAAGTC AATTACTTCT TTGTCTTTAT CGGCCTCATG 
TTGGTCGGTT TGGCAAGTTG GCTCTTCTAT AAAAAGAGCA AGAAATAA 



EF125-2 (SEQ ID NO:466) 



MRKNGPMV NRWLYGLMCL LLVLNYGTPL Mi 
GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE 
KKQIQIELNQ QALADTLVLT LNPTATEDVT 
SANEGSTEEA STNSSVPRSS EETVASTTKA 
TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH 
PNKPGSGDLV DAEGNVYGTY TISEDGTVRF 
GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK 
VTETWPTGNT FKSVKVYELV MNLDGTIKEV 
YQTTIDEAVI PDGGGDVPFK NHATLTSDNN 
EFTWEINYNY GEQTIPKDQA VITDTMGDNL 
DYKWINGDG SFAIDFLHDV TGAVKIDYKT 
SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY 
TGAQLTLGKD FMVEITRNAD GETGFKVSFI 
DHYRNTAAID WTDEAGNNHH SEDSKPFKPL 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW 
PDKSVILEEG KDYTLEVTTD NETGQQKIW 
QVSITGNGSE WHGDDNGDV WDIDHSGGH 
QAKTQVLREG WDATGVITF GGLPQGQYIL 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN 
SAFTIAASDR GKPATVIATA NFVNYQGTAK 
TTNNQGEIVA EHLAPGKYRF VETKAPTGYL 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS 
PKLPDGADYI lYPELVKVEI RGDFKGDPEI 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP 
NYQGTAQLTK ENETGEALAG AVFKVIDETG 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT 
VGLASWLFYK KSKK 



VWHYSANVS IKAAHWAAPN NTRKIQVDDQ 
FSYGQQQRAL TLKTGTDPTE STAITSSPAA 
lESKTTESTT VKPRVAGPTD ISDYFTGDET 
WNFNWSIPED VREQMKAGDY FEFQLPGNLK 
TFNERITSES DIHGDFSLDT HLNDSDGRGP 
QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 
GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 
PNGLDAEAW TATYGKMLDK RNIDYDEANQ 
TFEPDSLHLY SVTFDDKGNE WGAELVEGK 
KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA 
LMENAVITDT YEPVPGLTMV PNSLWKDTT 
GAYAKTSDAF HITYTTFFDV TELDANNPAL 
PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN 
KPDGSVEKVK PTQPLTDITM EEPSEKNQISfT 
YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 
ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
VETKAPEGYT VSDELAKGRV ITIDEETSAE 
FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
LIKKDVNGHL LSGATFKVLD AKGETIQTGL 
LNTTPVPFEI AEKNAGKPAV WASDNFVSY 
LGITATSGKD GKIIFRDLAP GTYYYKEIKA 
FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
TGYLLDTTLH PFDVTAQIjGK EQPIALGDLI 
QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV 
WLKAPFINY QGAAKLVKID QQKNALAGAE 
PGKYTFVETK APDGYQLSKQ AVAFTIAATA 
KQAARETGWL GLPKTNTQVN YFFVFIGLML 



EF125-3 (SEQ ID NO: 467) 
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TAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT GCAGAGCACT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA 
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GCGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
CTTGGT 

EF125-4 (SEQ ID NO:468) 

NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 

TTNNQGEIVA EHLAPGKYRF VETKAPTGYL LNTTPVPFEI AEKNAGKPAV WASDNFVSY 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS LGITATSGKD GKIIFRDLAP GTYYYKEIKA 
PKLPDGADYI lYPELVKVEI RGDFKGDPEI FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WIiKAPFINY QGAAKLVKID QQKNALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFTIAATA 
KDKPEIiVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWLG 

EF126-1 {SEQ ID NO:469) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATT6TAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
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TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA. AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF126-2 (SEQ ID NO:470) 

MF KKATKLLS™ VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVS'SGTMNQG TIAKEFPEAT 
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IPKNDNAHAC DVTPEDPTIT KDIENQEHLD 
DINKVLDIID VKVTDENGKD VTANGTVTQE 
IKTDATDEEL APYIEQGGIP NQADLNFGNE 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG 
TITTKIKTDA TDEELAPYIE QGGIPNQADL 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS 
GITKNKKRKN 



LTNREDSFDW HVKTAFGNET STWTQASMVD 
NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
KIKASATDEE LAPYIEQGGI PNQADLNFGN 
QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 



EF126-3 (SEQ ID NO:471) 
TGAA 

GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGAT 

EF126-4 (SEQ ID NO:472) 

EE AVKAGDTEGM TNTVKVKDDS 

LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTTAID 

EF127-1 (SEQ ID NO: 473) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
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AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC -GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGGGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACT6ATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF127-2 (SEQ ID NO:474) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
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DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 

ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 

EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 

ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 

TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 

PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF127-3 (SEQ ID NO:475) 

GAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 

ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAAT 



EF127-4 (SEQ ID NO:476) 
NQG TIAKEFPEAT 

IPKNDNAHAC DVTPEDPTIT KDIENQEHLD 
DINKVLDIID VKVTDENGKD VTANGTVTQE 
IKTDATDEEL APYIEQGGIP NQADLNFGNE 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV 

EF128-1 (SEQ ID NO: 477) 



LTNREDSFDW HVKTAFGNET STWTQASMVD 
NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DDIN 



TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGAC<5ATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
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TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACGGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGOXXSTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCAITTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF128-2 (SEQ ID NO:478) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKtiALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTiyiTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF128-3 (SEQ ID NO:479) 
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AGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 

CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT 6CACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCAT 



EF128-4 (SEQ ID NO: 480) 
DENGK DVTANGKVTQ 

ENNKVTFEMN XQADSYDYLS GHTYTMTITT 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG 
TITTKIKTDA TDEELAPYIE QGGIPNQADL 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS 



KIKASATDEE LAPYIEQGGI PNQADLNFGN 
QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
KGIH 



EF129-1 (SEQ ID NO:481) 

TGACAAGTGA AGAAACGTCT ATTTGCATCA 
ATTGCTACCC CAAGCATCGC TTTGGCGGAC 
CAAGAAATTT CATCATTAAA AGCAAAACAA 
GAAGCAGAAG TATCTTCAGT ATTTGATGAA 
CTAAAAGCAA AATCAGAACA ATTACAACAA 
AAACGTAACG AAGCAATCAA AAATCAAGCA 
ACAATGCTAG ATGCAGTTTT AGATGCGGAC 
GCTGTTTCAA CAATCGTAAG TGCCAACAAC 
CAAGCCGTTG TTGATAAAAA AGCTGAAAAC 
GAAGCTGAAT TAGAAACAAA ACGTCAAGAT 
ATGAAAGCTT CATTAGCATT AGAACAATCA 
AAACAAAAAG CAGCTGCTGA AGCAGAGCAA 
GCTGAAAAAG CCAAACAAGC TGCTGCAAAA 
CCAGTTGCCT CTTCATCAAC AACAGAAGCA 
GAATCAAGCA CGCAACAAAC AACTGAAACA 
GAAAATACTG GCTCTTCTTC ATCAGAACAA 
GGAAATAATG GTGGCCAAAC TGGTGGTGGA 
GCGCCTTCTG CTGATCCAAC AATCAATGCA 
CGTCCAGTAG TATGGGATGC AGGTTTGGCA 
GAAGCAGGTG GCATTCCAAA TGATCACTGG 
TGGGCGCCAG GTAACTCAGT AATCATGGCG 
TCAGGAAGCG GTCACCGTGA TTGGGAAATT 
TACTCAGGTA GCACAATCGT AGGACACTCA 



GTATTACTAT GTTCATTAAC GCTATCAGCA 
AATGTTGATA AAAAAATTGA AGAAAAAAAT 
OGGGATTTAG CTTCACAAGT ATCTTCTTTA 
AGCATGGCTT TACGTGAACA AAAGCAAACA 
GAAATTACAA ACTTGAATCA ACGTATTGAA 
CGTGATGTTC AAGTTAATGG ACAAAGCACA 
TCAGTTGCAG ATGCAATCAG CCGTGTTCAA 
GACTTAATGC AACAACAAAA AGAAGACAAA 
GAGAAAAAAG TGAAACAACT TGAAGCAACA 
TTACTTTCTA AACAATCTGA ATTAAAGGTA 
TCAGCTGAAA GTTCTAAAGC TGGCTTAGAA 
GCACGCTTAG CTGCTGAACA AAAAGCTGCA 
CCAGCTAAAG CTGAAGTGAA AGCAGAAGCA 
CAAGCACCAG CAAGCTCAAG CTCAGCAACT 
ACTACACCAA GTACAGATAA TAGTGCAACA 
CCAGTACAAC CTACAACACC AAGCGATAAT 
ACAGTTACAC CAACACCAGA ACCAACACCA 
TTGAACGTTC TACGTCAATC ATTAGGTTTA 
GCTTCTGCAA CTGCTCGTGC AGCACAAGTT 
TCTCGTGGAG ATGAAGTTAT CGCAATTATG 
TGGTACAATG AAACAAACAT GGTAACAGCT 
AACCCAGGTA TTACGCGTGT CGGTTTTGGT 
GCCTAA 



EF129-2 (SEQ ID NO: 482) 
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VKKRLFASV LLCSLTLSAI ATPSIALADN VDKKIEEKNQ EISSLKAKQG DLASQVSSLE 
AEVSSVFDES MALREQKQTL KAKSEQLQQE ITNLNQRIEK RNEAIKNQAR DVQVNGQSTT 
MLDAVLDADS VADAISRVQA VSTIVSANND LMQQQKEDKQ AWDKKAENE KKVKQLEATE 
AELETKRQDL LSKQSELNVM KASLALEQSS AESSKAGLEK QKAAAEAEQA RLAAEQKAAA 
EKAKQAAAKP AKAEVKAEAP VASSSTTEAQ APASSSSATE SSTQQTTETT TPSTDNSATE 
NTGSSSSEQP VQPTTPSDNG NNGGQTGGGT VTPTPEPTPA PSADPTINAL NVLRQSLGLR 
PWWDAGLAA SATARAAQVE AGGIPNDHWS RGDEVIAIMW APGNSVIMAW YNETNMVTAS 
GSGHRDWEIN PGITRVGFGY SGSTIVGHSA 

EF129-3 (SEQ ID NO:483) 



GGAC AATGTTGATA AAAAAi 
CAAGAAATTT CATCATTAAA 
GAAGCAGAAG TATCTTCAGT 
CTAAAAGCAA AATCAGAACA 
AAACGTAACG AAGCAATCAA 
ACAATGCTAG ATGCAGTTTT 
GCTGTTTCAA CAATCGTAAG 
CAAGCCGTTG TTGATAAAAA 
GAAGCTGAAT TAGAAACAAA 
ATGAAAGCTT CATTAGCATT 
AAACAAAAAG CAGCTGCTGA 
GCTGAAAAAG CCAAACAAGC 
CCAGTTGCCT CTTCATCAAC 
GAATCAAGCA CGCAACAAAC 
GAAAATACTG GCTCTTCTTC 
GGAAATAATG GTGGCCAAAC 
GCGCCTTCTG CTGATCCAAC 
CGTCCAGTAG TATGGGATGC 
GAAGCAGGTG GCATTCCAAA 
TGGGCGCCAG GTAACTCAGT 
TCAGGAAGCG GTCACCGTGA 
TACTCAGGTA GCACAATCGT 



lTTGA AGAAAAAAAT 
AGCAAAACAA GGGGATTTAG 
ATTTGATGAA AGCATGGCTT 
ATTACAACAA GAAATTACAA 
AAATCAAGCA CGTGATGTTC 
AGATGCGGAC TCAGTTGCAG 
TGCCAACAAC GACTTAATGC 
AGCTGAAAAC GAGAAAAAAG 
ACGTCAAGAT TTACTTTCTA 
AGAACAATCA TCAGCTGAAA 
AGCAGAGCAA GCACGCTTAG 
TGCTGCAAAA CCAGCTAAAG 
AACAGAAGCA CAAGCACCAG 
AACTGAAACA ACTACACCAA 
ATCAGAACAA CCAGTACAAC 
TGGTGGTGGA ACAGTTACAC 
AATCAATGCA TTGAACGTTC 
AGGTTTGGCA GCTTCTGCAA 
TGATCACTGG TCTCGTGGAG 
AATCATGGCG TGGTACAATG 
TTGGGAAATT AACCCAGGTA 
AGGACACTCA GCC 



CTTCACAAGT ATCTTCTTTA 
TACGTGAACA AAAGCAAACA 
ACTTGAATCA ACGTATTGAA 
AAGTTAATGG ACAAAGCACA 
ATGCAATCAG CCGTGTTCAA 
AACAACAAAA AGAAGACAAA 
TGAAACAACT TGAAGCAACA 
AACAATCTGA ATTAAACGTA 
GTTCTAAAGC TGGCTTAGAA 
CTGCTGAACA AAAAGCTGCA 
CTGAAGTGAA AGCAGAAGCA 
CAAGCTCAAG CTCAGCAACT 
GTACAGATAA TAGTGCAACA 
CTACAACACC AAGCGATAAT 
CAACACCAGA ACCAACACCA 
TACGTCAATC ATTAGGTTTA 
CTGCTCGTGC AGCACAAGTT 
ATGAAGTTAT CGCAATTATG 
AAACAAACAT GGTAACAGCT 
TTACGCGTGT CGGTTTTGGT 



EF129-4 (SEQ ID NO:484) 

DN VDKKIEEKNQ EISSLKAKQG DLASQVSSLE 

AEVSSVFDES MALREQKQTL KAKSEQLQQE ITNLNQRIEK RNEAIKNQAR DVQVNGQSTT 
MLDAVLDADS VADAISRVQA VSTIVSANND LMQQQKEDKQ AWDKKAENE KKVKQLEATE 
AELETKRQDL LSKQSELNVM KASLALEQSS AESSKAGLEK QKAAAEAEQA RLAAEQKAAA 
EKAKQAAAKP AKAEVKAEAP VASSSTTEAQ APASSSSATE SSTQQTTETT TPSTDNSATE 
NTGSSSSEQP VQPTTPSDNG NNGGQTGGGT VTPTPEPTPA PSADPTINAL NVLRQSLGLR 
PWWDAGLAA SATARAAQVE AGGIPNDHWS RGDEVIAIMW APGNSVIMAW YNETNMVTAS 
GSGHRDWEIN PGITRVGFGY SGSTIVGHSA 

EF130-1 (SEQ ID NO:485) 

TGATACATTA AAAGGAGGGA AAATATGCGC CCAAAAGAGA AAAAAAGAGG AAAAAATTGG 
TTAATCAACA GTTTATTAGT TTTACTATTT ATCATTGGCT TAGCCTTAAT TTTTAACAAT 
CAGATACGTA GTTGGGTGGT TCAACAAAAT AGCCGCTCGT ACGCCGTTAG CAAGTTGAAA 
CCAGCTGATG TGAAGAAAAA TATGGCTCGT GAAACAACGT TTGACTTTGA TTCAGTTGAG 
TCCTTGAGCA CAGAAGCGGT GATGAAAGCC CAATTTGAAA ACAAAAACTT ACCTGTGATT 
GGTGCCATTG CGATACCAAG TGTCGAAATT AATTTGCCCA TTTTTAAAGG ATTGTCCAAT 
GTCGCTTTAT TAACTGGTGC CGGGACCATG AAAGAAGATC AAGTCATGGG GAAAAACAAT 
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TATGCCTTGG CTAGTCATCG AACGGAAGAT GGCGTTTCCT TATTTTCACC TTTAGAAAGA 
ACCAAAAAAG ACGAACTCAT TTATATCACT GATTTATCTA CTGTTTATAC ATACAAAATA 
ACTTCTGTAG AAAAAATCGA ACCAACCCGT GTTGAGTTAA TTGATGACGT TCCTGGTCAA 
AATATGATTA CCTTAATTAC CTGTGGCGAT TTACAAGCAA CGACGCGAAT TGCTGTTCAA 
GGAACATTAG CAGCAACGAC GCCTATTAAA GACGCCAACG ACGATATGTT GAAGGCTTTC 
CAATTGGAGC AAAAAACTTT AGCCGATTGG GTGGCTTAA ^ 

EF130-2 (SEQ ID NO: 486) 

YIKRRENMRP KEKKRGKNWL INSLLVLLFI IGLALIFNNQ IRSWWQQNS RSYAVSKLKP 
ADVKKNMARE TTFDFDSVES LSTEAVMKAQ FENKNLPVIG AIAIPSVEIN LPIFKGLSNV 
ALLTGAGTMK EDQVMGKNNY ALASHRTEDG VSLFSPLERT KKDELIYITD LSTVYTYKIT 
SVEKIEPTRV ELIDDVPGQN MITLITCGDL QATTRIAVQG TLAATTPIKD ANDDMLKAFQ 
LEQKTLADWV A 

EF130-3 (SEQ ID NO:487) 

CGTTAG CAAGTTGAAA 

CCAGCTGATG TGAAGAAAAA TATGGCTCGT GAAACAACGT TTGACTTTGA TTCAGTTGAG 
TCCTTGAGCA CAGAAGCGGT GATGAAAGCC CAATTTGAAA ACAAAAACTT ACCTGTGATT 
GGTGCCATTG CGATACCAAG TGTCGAAATT AATTTGCCCA TTTTTAAAGG ATTGTCCAAT 
GTCGCTTTAT TAACTGGTGC CGGGACCATG AAAGAAGATC AAGTCATGGG GAAAAACAAT 
TATGCCTTGG CTAGTCATCG AACGGAAGAT GGCGTTTCCT TATTTTCACC TTTAGAAAGA 
ACCAAAAAAG ACGAACTCAT TTATATCACT GATTTATCTA CTGTTTATAC ATACAAAATA 
ACTTCTGTAG AAAAAATCGA ACCAACCCGT GTTGAGTTAA TTGATGACGT TCCTGGTCAA 
AATATGATTA CCTTAATTAC CTGTGGCGAT TTACAAGCAA CGACGCGAAT TGCTGTTCAA 
GGAACATTAG CAGCAACGAC GCCTATTAAA GACGCCAACG ACGATATGTT GAAGGCTTTC 
CAATTGGAGC AAAAAACTTT AGCCGATTGG GTGGCT 



EF130-4 {SEQ ID NO:488) 
VSKLKP 

ADVKKNMARE TTFDFDSVES LSTEAVMKAQ 
ALLTGAGTMK EDQVMGKNNY ALASHRTEDG 
SVEKIEPTRV ELIDDVPGQN MITLITCGDL 
LEQKTLADWV A 



FENKNLPVIG AIAIPSVEIN LPIFKGLSNV 
VSLFSPLERT KKDELIYITD LSTVYTYKIT 
QATTRIAVQG TLAATTPIKD ANDDMLKAFQ 



EF131-1 (SEQ ID NO:489) 

TAGGCGGAGG TAAGCGGTAT GCGTAAACGA 
TGGCTTTTTA TAGTATGTTT GTTGGTGGTG 
TTCTTTTTCA CTAGAGATTC ACAAGTTAGT 
CGCCGAAGTG ATAATTATGC GAATTTAACG 
CTTGATCAAA AAATTCAAGA AACAAATTAT 
CAGGTTTTAG TAAATAAAGG ATATGGCTTT 
CCAAACACAA GGTTTCAGAT TGGCTCAATT 
AAAGCAATTG AAGAAGGTAA ACTTACATTA 
ATTCAAGGTG CTGAGGATAT TACGATTAGC 
TTATCAGCAA TGCCTAATAA TATCGTTACC 
AATACCATTC AAGTCAATAA AGGAAAATAC 
GCAGGAATGT TAGAGAAAAT GTATCAACGT 
CACAAAACGG CTGGTTTAAA GAATTTTGGC 
AATTCAACAA GTTATAAATG GACAGAAGAT 



CATGCAAAGA' AAAGACATGG AGGAGTGAAT 
ATTGGTGGTA GTGGTTATTT AATAAAAACG 
CAAGAATCGA AAGTGGTCTT GGAAGAAGAT 
AAAGAAATAG TTGCACCAGA TAGTGGCGAA 
ATTGGTTCGG CTTTGATCAT TAAAGATGAT 
GCCAATTTTG AAAAGCAACA AGCCAACACG 
CAAAAATCTT TTACCACAAC CTTGATCTTA 
GATACAAAAC TCGCTACGTT TTATCCGCAA 
GATATGTTGA ATATGACAAG TGGTTTAAAG 
GATGAAGAAA TTATTCAATT TGTTAAACAA 
AATTATTCCC CAGTAAATTT TGTCCTTTTA 
ACCTATCAAG AATTATTTAA TAATCTTTAT 
TTCTATGAAA CCTTATTGGA ACAGCCCAAT 
AATTCATATA ACCAAGTGCT CTCAATTCCT 
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GCAGCTAGTT TTGCCCATGA ATTTGGGACT GGTAATGTGG ATATGACGAC AGGTGATTTG 
TATTGGTACT TACATCAATT AACGAGTGGA CATTTAGTTT CCACCGCACT TTTGCAAAAA 
TTATGGACGT CTTCTCAGCA AAGCTCTTAT CATGGCGGCA TCTATGTTCA TGATAATTAT 
TTACGTTTAC ACGGCGTTGA AGCGGGTCAA CAAGCCCTGG TTTTATTTTC AAAAGATATG 
AAGACAGGGG TCATATTGCT AACTAACTGT GTGAATCCAG CGAAATACAA AGAATTAATT 
GGTTCGTTGT TCCATGATGT AACCAATTTA ACTGTTAAAT TTTAA 

EF131-2 (SEQ ID NO:490) 

MRKRH AKKRHGGVNW LFIVCLLWI GGSGYLIKTF FFTRDSQVSQ ESKWLEEDR 
RSDNYANLTK EIVAPDSGEL DQKIQETNYI GSALIIKDDQ VLVNKGYGFA NFEKQQANTP 
NTRFQIGSIQ KSFTTTLILK AIEEGKLTLD TKLATFYPQI QGAEDITISD MLNMTSGIiKL 
SAMPNNIVTD EEIIQFVKQN TIQVNKGKYN YSPVNFVLLA GMLEKMYQRT YQELFNNLYH 
KTAGLKNFGF YETLLEQPNN STSYKWTEDN SYNQVLSIPA ASFAHEFGTG NVDMTTGDLY 
WYLHQLTSGH LVSTALLQKL WTSSQQSSYH GGIYVHDNYL RLHGVEAGQQ ALVLFSKDMK 
TGVILLTNCV NPAKYKELIG SLFHDVTNLT VKF 

EF131-3 {SEQ ID NO:491) 

TTT AATAAAAACG 

TTCTTTTTCA CTAGAGATTC ACAAGTTAGT CAAGAATCGA AAGTGGTCTT GGAAGAAGAT 
CGCCGAAGTG ATAATTATGC GAATTTAACG AAAGAAATAG TTGCACCAGA TAGTGGCGAA 
CTTGATCAAA AAATTCAAGA AACAAATTAT ATTGGTTCGG CTTTGATCAT TAAAGATGAT 
CAGGTTTTAG TAAATAAAGG ATATGGCTTT GCCAATTTTG AAAAGCAACA AGCCAACACG 
CCAAACACAA GGTTTCAGAT TGGCTCAATT CAAAAATCTT TTACCACAAC CTTGATCTTA 
AAAGCAATTG AAGAAGGTAA ACTTACATTA GATACAAAAC TCGCTACGTT TTATCCGCAA 
ATTCAAGGTG CTGAGGATAT TACGATTAGC GATATGTTGA ATATGACAAG TGGTTTAAAG 
TTATCAGCAA TGCCTAATAA TATCGTTACC GATGAAGAAA TTATTCAATT TGTTAAACAA 
AATACCATTC AAGTCAATAA AGGAAAATAC AATTATTCCC CAGTAAATTT TGTCCTTTTA 
GCAGGAATGT TAGAGAAAAT GTATCAACGT ACCTATCAAG AATTATTTAA TAATCTTTAT 
CACAAAACGG CTGGTTTAAA GAATTTTGGC TTCTATGAAA CCTTATTGGA ACAGCCCAAT 
AATTCAACAA GTTATAAATG GACAGAAGAT AATTCATATA ACCAAGTGCT CTCAATTCCT 
GCAGCTAGTT TTGCCCATGA ATTTGGGACT GGTAATGTGG ATATGACGAC AGGTGATTTG 
TATTGGTACT TACATCAATT AACGAGTGGA CATTTAGTTT CCACCGCACT TTTGCAAAAA 
TTATGGACGT CTTCTCAGCA AAGCTCTTAT CATGGCGGCA TCTATGTTCA TGATAATTAT 
TTACGTTTAC ACGGCGTTGA AGCGGGTCAA CAAGCCCTGG TTTTATTTTC AAAAGATATG 
AAGACAGGGG TCATATTGCT AACTAACTGT GTGAATCCAG CGAAATACAA AGAATTAATT 
GGTTCGTTGT TCCATGATGT AACCAATTTA ACTGTTAAAT TT 

EF131-4 (SEQ ID NO:492) 

LIKTF FFTRDSQVSQ ESKWLEEDR 

RSDNYANLTK EIVAPDSGEL DQKIQETNYI GSALIIKDDQ VLVNKGYGFA NFEKQQANTP 
NTRFQIGSIQ KSFTTTLILK AIEEGKLTLD TKLATFYPQI QGAEDITISD MLNMTSGLKL 
SAMPNNIVTD EEIIQFVKQN TIQVNKGKYN YSPVNFVLLA GMLEKMYQRT YQELFNNLYH 
KTAGLKNFGF YETLLEQPNN STSYKWTEDN SYNQVLSIPA ASFAHEFGTG NVDMTTGDLY 
WYLHQLTSGH LVSTALLQKL WTSSQQSSYH GGIYVHDNYL RLHGVEAGQQ ALVLFSKDMK 
TGVILLTNCV NPAKYKELIG SLFHDVTNLT VKF 



EF132-1 (SEQ ID NO:493) 

TAGTTTTCTAATCTCACCAAAACAAAAATTTTTAAGAAAGAAGGAGAGATCGTTATGATGAGAAAATG^ 
GTGGGAAGTCTGGGAATGTTGATTGCTCTTTTTATATTCGGGGCATGTTCAACAAATAGTAAAGACAAAGATACAG 
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GCTTCGAACGAAAAATTAAAGGTAGTAGTTACTAATTCGATTTTAGCAGATATTACTG 

ATTGATTTACACAGTATCGTACCTATTGGGAAAGATCCCCACGAATATGAACCtTTGCCTGAAGATGT^ 

TCAAAAGCAGATTTGATTTTTTATAACGGTGTTAACTTGGAmACTGGAGGAAATC 

mATGCGAACAAAGAGGAAAACAAAGACTATTTTGCAGCAAGTCATGGCATAGATGTTATOT 

GAGAAAGGGAAGGAAGATCCCCATGCTTGGTTAAATTTAGAAAACGGTATTATTTACG 

TTAGCGGAAAAAGATCCTGATAATAAAAAATTCTATAAAGAAAATCTAGATAAGTATATTGAAAAGTT^ 

GACAAAGAAGCTAAATCTAAATTTGCTTCAATTCCGAATGATAAAAAAATGATTGTTACAAGTGAAGG 

TATTTCTCGAAAGCGTATAATGTGCCTTCTGCTTACATTTGGGAAAtCAACACTGAAGAAGAAGGAAC^ 

ATAAAACACTTAGTTGAAAAATTACGCACAACAAAAGTTCCCTCCTTATTCGTAGAAAGTAGTGTGGACGATAGACCG 

ATGAAAACAGTATCAAAAGATACCAATATTCCTATCTATTCAACGATTTTTACTCATTCAATTGCAGA 

GATGGTGATAGTTACTATGCGATGATGAAATGGAACCTGGATAAAATTGCTGAAGGCCTT^ 

EF132-2 (SEQ ID NO:494) 

MMRKWKVWGSLGMLIALFIFGACSTNSKDKDTVASNEKLKVVVTNSIL^ 

LPEDVQKTSKADLIFYNGVNLXTGGNAWFTKLVKXANKEEl^DYFAASDGIDVIYLEGQSEKGKEDPHAWLNLE^^ 
YAKNI EKWLAEKDPDNKKFYKENLDKY I EKLDSLDKEAKSKFAS I PNDKKMI VTSEGCFKYFSKAYNVPSAY I WE INT 
EEEGTPDQIKHLVEKLRTTKVPSLFVESSVDDRPMKWSKDTNIPIYSTIFTDSIAEKGQDGDSYYAMblKWNLD 
GLSK. 

EF132-3 (SEQ ID NO:495) 

ATGTTCAACAAATAGTAAAGACAAAGATACAGTCGCTTCGAACGAAAAATTAAAGGTAGTAGTTACTAAT^ 

AGCAGATAITACTGAAAATATAGCAAAAGATAAAATTGATTTACACAGTATCGTACCTATT^ 

ATATGAACCtTTGCCTGAAGATGTTCAAAAAACTTCAAAAGCAGATTTGATTTT^ 

TGGAGGAAATGCTTGGTTTACAAAATTAGTAAAAmATGCGAACAAAGAGGAAAACAAAGACTATT^^ 

TGGCATAGATGTTATTTACTTAGAGGGTCAGAGTGAGAAAGGGAAGGAAGATCCCCATGCTTGGT^ 

CGGTATTATTTACGCTAAAAATATTGAAAAATGGTTAGCGGAAAAAGATCCTGATAATAAAAAATTCTATA 

TCTAGATAAGTATATTGAAAAGTTGGATTCTCTAGACAAAGAAGCTAAATCTAAATTTGCTTC^ 

AAAAATGATTGTTACAAGTCAAGGATGCTTtAAATATTTCTCGAAAGCGTATAATGTO 

AAtCAACACTGAAGAAGAAGGAACACCAGATCAAATAAAACACTTAGTTGAAAAATTACGCACAACAAAAGTTCCCTC 
CTTATTCGTAGAAAGTAGTGTGGACGATAGACCGATGAAAACAGTATCAAAAGATACCAATATTCCTATCTAT^ 
GATTTTTACTGATTCAATTGCAGAAAAAGGACAAGATGGTGATAGTTACTATGCGATGATGAAATGGAACCTC 
AATTGCTGAAGGCCTTTCGAAA 



EF132-4 (SEQ ID NO:496) 

CSTNSKDKDTVASNEKLKWVTNSILADITENIAKDKIDLHSIVPIGKDPHEYEPLPEDVQKTSKADLIFYNGVNLXT 
GGNAWFTKLVKXANKEENKDYFAASDGIDVIYLEGQSEKGKEDPHAWLNLENGIIYAKNIEKWI^ 
LDKYIEKLDSLDKEAKSKFASIPNDKKMIVTSEGCFKYFSKAYIWPSAYIWEINTEEEGTPDQIKHLVEKLRTTKVPS 
LFVESSVDDRPMKTVSKDTNIPIYSTIFTDSIAEKGQDGDSYYAMMKWNLDKIAEGLSK 
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ink and Derwent databases, 


;s in GenBE 


4.20E-36 


4.00E-34 


4.10E-74 1 


2.20E-67 1 


2.50E-67 


9.80E-54 


6.70E.40 


3.10E-28 


2.40E-25 


8.60E-14 


8.30E-18 1 


7.80E-13 1 


1.70E-11 


1.70E-11 


1.70E-11 


1.70E-11 


1.70E-11 


2.70E-11 


4.70E-16 


6.00E-15 


6.20E-15 


6.20E-I5 


Table 2. Closest matching sequences between the polypeptides of the present invention and sequence 
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S.thermophilm exopolysaccharide biosynthesis protein EpsR. 


S.thermophilus exopolysaccharide synthesis operon epsA gene 
product. 


Helicobacter-specific ATPase 439. 


Mycobacterium BCG immunogen. 


Helicobacter-specific ATPase 948 (ORF-4). 


Rat homologue of human Wilson disease gene ATP7B. 


Wilson disease protein ATP7B. 


Product of the sscl gene. 


Flea sodium pump alpha subunit. 


H. pylori transporter protein, 14ce20219orfl. 


Bacteroides fragilis RprX regulatory response protein. 


Tomato TGETRl ethylene response protein. 


Ethylene response (ETR) gene product. 


Ethylene response (ETR) mutant protein etrl-1. 


Ethylene response (ETR) mutant protein etrl-2. 


Ethylene response (ETR) mutant protein etrl-3. 


Ethylene response (ETR) mutant protein etrl-4. 


Regulatory protein VanS involved in glycopeptide resistance. 


Penicillin binding protein PBP2A-epi. 


Penicillin binding protein PBP2A-27R. 


Penicillin binding protein derivative #1 . 


Penicillin binding protein derivative #2. 


W14070 


W22169 


R97280 


R48036 


W06712 


R70419 


R72343 


R06376 


R75396 


W20891 


R56667 


R74630 


R69849 


R69850 


R69851 


R69852 


R69853 


R24296 


R27253 


R27256 


R27257 


R27258 


|EF075.2 


EF075-2 


IEF077-2 


|EF077-2 


IEF077-2 


EF077-2 


EF077-2 


|EF077-2 


|EF077-2 


|EF077-2 


IEF078-2 


IEF078-2 


IEF078-2 


|EF078-2 


IEF078-2 


IEF078-2 


IEF078-2 


IEF078-2 
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TABLE 3. Conservative Amino Acid Substitutions. 



PCTAJS98/08959 



Aromatic 


Phenylalanine 




Tryptophan 




Tyrosine 


Hydrophobic 


Leucine 




Isoleucine 




Valine 


Polar 


Glutamine 




Acnarapine 


Basic 


Arginine 




Lysine 




Histidine 


Acidic 


Aspartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine 
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Table 4. Residues Comprising Antigenic Epitope-Bearing Portion. 



EFOOl-2 


from about Asp- 150 to about Lys-152, from about Ser-256 to about Tyr- 
259, from about Lys-360 to about Lys-363, from about Asn-406 to about 

A «— A AO 

Asp-408. 






EF002-2 


from about Asp-80 to about Asp-83, from about Asp-281 to about Gly- 
2o3. 






brUOi-2 


irom about Asn-263 to about Gly-266. 






EF004-2 


from about Asn-23 to about Asn-26, from about Lys-83 to about Ser-87, 
from about Tyr-154 to about Asp-159. 






EF005-2 


from about Lys-249 to about Glu-252. 






EF006-2 


from about GIy-23 to about Asp-28. 






EF008-2 


from about Thr-92 to about Gly-94, from about Pro-161 to about Asp- 
165, fix)m about Gly-287 to about Thr-289. 






EFOlO-2 


from about Pro- 129 to about Asri-131. 






EF012-2 


from about Asp-77 to about Asp-79, from about Asp-94 to about Lys-98, 
from about Asp-256 to about Thr-258, from about Glu-461 to about Asn- 
468. 






EF013-2 


from about Thr-30 to about Asp-32, from about Glu-73 to about Ala-75, 
from about Gin- 164 to about Asn-166, from about Lys-193 to about Gly- 
195. 






hrU14-i 


irom about Ser-203 to about Asp-206, from about Gln-314 to about Gly- 
316 






EF015-2 


from about Pro-66 to about Gly-69. 






EF016-2 


from about Lys-236 to about Asn-239. 






EF017-2 


from about Ser-90 to about Gly-93, from about Thr-197 to about Lys- 
199, from about Lys-230 to about Asn-233, from about Ser-428 to about 
Gly.431. 






EF018-2 


from about Lys-159 to about Tyr-161, from about Asn-165 to about Ser- 
167, from about Asn-250 to about Arg-256, from about Asn-392 to about 
Gly-395, from about Lys-416 to about Tyr-418, from about Asn-428 to 
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Table 4. Residues Comprising Antigenic Epitope-Bearing Portion. 





about Arg-430. 






EF019-2 


from about Arg-209 to about Ser-21 1, from about Lys-287 to about Ser- 
290. 






EF020-2 


from about Lys-57 to about Asn-62. 






EF021-2 


from about Ser-33 to about Gly-35, from about Glu-77 to about Gly-81, 
from about Asp-139 to about Lys-141, from about Glu-255 to about Ser- 
258, from about Gln-271 to about Tyr-277. 






EF023-2 


from about Lys-232 to about Asp-234, from about Arg-304 to about Gly- 
306, from about Thr-453 to about Arg-456, from about Ser-478 to about 
Thr-480. 






EF025-2 


from about Arg- 1 83 to about Asp- 185. 






EF026-2 


from about Ser-25 to about Asp-30, from about Asp-90 to about Asp-94, 
from about Gin- 107 to about Asn-1 10. 






EF027-2 


from about Gln-72 to about Lys-74, from about Lys-229 to about Asp- 
231. 






EF028-2 


from about Asp-186 to about Gln-188. 






EF029-2 


from about Asp-1 18 to about Lys-122. from about Asp-124 to about 
Tyr-126. 






EF031-2 


from about Glu-30 to about Gly-33. 






EF034-2 


from about Glu-25 to about Gly-27, from about Glu-75 to about Thr-77. 










EF36-2 


from about Gin- 1 77 to about Ser- 179. 






EF037-2 


from about Ser-25 to about Asp-30, from about Asp-90 to about Asp-94, 
from about Gin- 107 to about Asn-1 10. 






EF038-2 


from about Asn-77 to about Lys-79, from about Tyr-88 to about Asn-92. 






EF040-2 


from about Lys-167 to about Gly-172, from about Lys-240 to about 
Asn-242. 
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EFp44-2 


from about Arg-192 to about Gly-194, from about Asn-200 to about. Asn- 
203. 






EF045-2 


from about Asp- 159 to about Asn-161, from about His- 172 to about Gly- 
174, from about Tyr-261 to about Gly-264, from about Lys-305 to about 
Glu-308. 






EF046-2 


from- about Ser-18 to about Gly-23, from about Gln-41 to about Ser-47, 
from about Thr-76 to about Asp-78. 






EF047-2 


from about Asn-28 to about Asp-30, from about Asp-273 to about Asn- 
277. 






EF048-2 


from about Asp-138 to about Lys-141, from about Asp-152 to about 
Gly-154. 






EF051-2 


from about Asp-73 to about Gly-76. 






EF053-2 


from about Ser-79 to about Gly-82. 






EF055-2 


from about Asp-26 to about Gly-28, from about Gln-67 to about Asp-69, 
from about Arg-71 to about Gly-74, from about Arg-87 to about Gly-89. 






EF056-2 


from about Arg-71 to about Gly-74, from about Arg-87 to about Gly-89. 










EF058-2 


from about Lys-129 to about Gly-133, from about Gln-571 to about Tyr- 
573, from about Pro-586 to about Gly-591. 






EF065-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF066-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-5] 8, from about Glu-639 to about Lys-642. 






EF067-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491 , from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 
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EF073-2 


from about Met-98 to about Arg-lOO, from about Arg-l 10 to about Asp- 
112. 






EF074-2 


from about Ser-53 to about Tyr-59, from about Ser-86 to about Gly-88, 
from about Pro-97 to about Gin- 100, from about Gln-230 to about Gly- 
232. 






EF076-2 


from about Asn-38 to about Tyr-40, from about Asp-48 to about Asn-53, 
from about Lys-79 to about Gly-8 1 . 






EF077-2 


from about Arg-41 1 to about Gly-413. 






EF078-2 


from about Thr-294 to about Gly-296, from about Asp-366 to about Gln- 
368, from about Glu-524 to about Gly-526. 






EF080-2 


from about Glu-164 to about Gly- 166, from about Ser-206 to about Tyr- 
208, from about Lys-239 to about Gly-243. 






EF081-2 


from about Asn-7 to about Ser-ll, from about Lys-77 to about Tyr-80, 
from about Lys-1 12 to about Asn-1 14, from about Gly-162 to about Asp- 
164, from about Arg-l 81 to about Gly-183. 






EF083-2 


from about Gln-38 to about Arg-40. 






EF084-2 


from about Lys-1 40 to about Asp- 142, from about Gly- 164 to about Arg- 
166, from about Arg-262 to about Gly-264. 






EF085-2 


from about Asn-95 to about Asp-97, from about Arg-l 12 to about Asp- 
1 14, from about Asp-258 to about Ser-260, from about Arg-401 to about 
Ser-403. 






EF086-2 


from about Pro-1 12 to about Gly-1 15, from about Ser-222 to about Ser- 
224, from about Asn-296 to about Gly-299, from about Thr-346 to about 
Lys-348, from about Asp-428 to about Ser-432. 






EF087-2 


from about Pro-1 12 to about Gly-1 15, from about Ser-222 to about Ser- 
224, from about Asn-296 to about Gly-299, from about Thr-346 to about 
Lys-348, from about Asp-428 to about Ser-432. 






EF088-2 


from about Pro-1 12 to about GIy-1 15, from about Ser-222 to about Ser- 
224, from about Asn-296 to about Gly-299, from about Thr-346 to about 
Lys-348, fix)m about Asp-428 to about Ser-432. 
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EF090-2 


from about Arg-2 to about Arg-D. 






EF091-2 


from about Gln-40 to about Asp-43. 






EF093-2 


from about Lys-95 to about Gly-97. 






EF094-2 


from about Asp-314 to about Asp-316. 






EF095-2 


from about Ser-328 to about Thr-330, from about Asp-359 to about Asp- 
363, from about Glu-637 to about Gly-639, from about Asn-744 to about 
Gly-746. 






EF096-2 


from about Pro-128 to about Asn-130, from about Ser-193 to about Asp- 
196. 






EF097-2 


from about Val-357 to about Gly-359. 






EF099-2 


from about Glu-44 to about Asp-47, from about Lys-154 to about Gly- 
156, from about Asn-286 to about Asp-289. 






EFlOl-2 


from about Lys-40 to about Asp-42, from about Pro-255 to about Asn- 
258, from about Lys-288 to about Gly-290. 






EF102-2 


from about Asp-314 to about Asp-316. 






EF 103-2 


from about Asn-46 to about Gly-48. 






EF 104-2 


from about Pro-232 to about Lys-237, from about Ala-362 to about Asn- 
366, from about Ser-421 to about Gly-423, from about Lys-488 to about 
Ser-490, from about Asp-550 to about Asn-552, from about Pro-637 to 
about Lys-640, from about Asp-727 to about Gly-729, from about Asn- 
751 to about Ser-754, from about Lys-771 to about Asn-774, from about 
Ile-835 to about Asn-837, from about Pro-851 to about Glyr853, 






EF 105-2 


from about Ser-40 to about Gly-43, from about Asn-94 to about Gln-97, 
from about Gln-220 to about Uiy-z2z, irom aoout Asn-zoJ to aoout uiy- 
265. 






EF106-2 


from about Asp-72 to about Gly-75, from about Thr-274 to about Asp- 
277, from about Asn-310 to about Arg-313. 






EF107-2 


from about Thr-155 to about Asn-157, from about Thr-189 to about Asp- 
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191, from about Arg-270 to about Gly-272, from about Thr-330 to about 
Lys-335, from about Asp-365 to about Asp-368, from about Pro-451 to 
about Asp-453, from about Gly-485 to about Thr-488. 






EF108-2 


from about Lys-142 to about Trp-145, from about Thr-147 to about Tyr- 
150, from about Arg-212 to about Gly-214, from about Ser-248 to about 
Asp-251, from about Asp-384 to about Asp-387, from about Pro-481 to 
about Arg-483, from about Lys-491 to about Gly-494, from about Thr- 
619 to about Gly-624, from about Asp-656 to about Asp-659, from about 
Lys-717 to about Asn-721, from about Ser-822 to about Gly-824, fit)m 
about Tyr-1 137 to about Thr-l 141. 






EFllO-2 


from about Pro- 123 to about Gly-127, from about Thr-223 to about Gly- 
225. 






EFlll-2 


from about Lys-207 to about Asn-209, from about Asp-245 to about 
Asn-248, from about Lys-396 to about Asp-398, from about Glu-429 to 
about Ser-432, from about Thr-470 to about His-474. 






EF119-2 


from about Asp-90 to about Asn-92, from about Gin- 142 to about Gly- 
144. 






EF121-2 


from about Asn-159 to about Asp- 161, from about Asn-351 to about 
Lys-353, from about Pro-658 to about Gly-660, from about Lys-786 to 
about Ser-789. 






EF122-2 


from about Asn-159 to about Asp-161, from about Asn-351 to about 
Lys-353, from about Pro-658 to about Gly-660, from about Lys-786 to 
about Ser-789. 






EF123-2 


from about Asn-331 to about Arg-336, from about Asp-634 to about Gly- 
636, from about Glu-780 to about Ser-782, from about Tyr-909 to about 
Asn-91 1. ftx)m about Lys-939 to about Glu-942, from about Asp-1074 to 
about Gly-1076, from about Asp-1367 to about Gly-1369, from about 
Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-1518, 
from about Lys-1656 to about Asp-1660, from about Lys-1860 to about 
Gln.1863, from about Ser-1916 to about Gln-1919, from about Pro-1940 
to about Gly-1942. 






EF124.2 


from about Asn-331 to about Arg-336, from about Asp-634 to about Gly- 
636, from about Glu-780 to about Ser-782, from about Tyr-909 to about 
Asn-91 1, from about Lys-939 to about Glu-942, from about Asp-1074 to 
about Gly-1076, from about Asp-1367 to about Gly-1369, from about 
Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-1518, 
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from about Lys-1656 to about Asp-1660, from about Lys-1860 to about 
Gln-1863, from about Ser-1916 to about Gln-1919, from about Pro-1940 
to about Gly-1942. 






EF125-2 


from about Asn-33 1 to about Arg-336, from about Asp-634 to about Gly- 
636, from about Glu-780 to about Ser-782, from about Tyr-909 to about 
Asn-91 1, from about Lys-939 to about Glu-942, from about Asp-1074 to 
about Gly-1076, from about Asp-1367 to about Gly-1369, from about 
Pro- 1433 to about Lys-1435, from about Gly-1516 to about Asp-1518, 
from about Lys-1656 to about Asp- 1660, from about Lys-1860 to about 
Gln-1863, from about Ser-1916 to about Gln-1919, from about Pro-1940 
to about Gly-1942. 






EF126-2 


from about Ser-236 to about T)n:-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF127-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF128-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF129-2 


from about Asn-300 to about Gly-302, from about Ser-316 to about Gly- 
319, from about Asn-385 to about His-387 






EF131-2 


from about Lys-201 to about Tyr-204, from about Glu-263 to about Ser- 
266. 






EF132-2 


from about Thr-26 to about Ser-28. 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuIe Ubis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 10 , line 12 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet Q 



Nan * of depositary institution American Type Culture Collection 



Address of depositary institution {including postal code and country) 

10801 University Boulevard 
Manasas. Virginia 201 10-2209 
United States of America 



Date of deposit May 2, 1997 



Accession Number 55969 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the mdicoiiom are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS Oeave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., "Accession 
Number of Deposit'*) 



For receiving Office use only , 



Authorized 



This sheet urns received with the international application 
d officer >S 



• For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 
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What Is Claimed Is: 

1. An isolated nucleic acid molecule comprising a polynucleotide having a nucleotide 
sequence selected from the group consisting of: 

(a) a nucleotide sequence encoding any one of the amino acid sequences of the 
polypeptides shown in Table 1; or 

(b) a nucleotide sequence complementary to any one of the nucleotide sequences in (a). 

(c) a nucleotide sequence at least 95% identical to any one of the nucleotide sequences 
shown in Table 1 ; or, 

(d) a nucleotide sequence at least 95% identical to a nucleotide sequence complementary 
to any one of the nucleotide sequences shown in Table 1. 

2. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which 
hybridizes under stringent hybridization conditions to a polynucleotide having a 
nucleotide sequence identical to a nucleotide sequence in (a) or (b) of claim 1. 

3. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which 
encodes an epitope-bearing portion of a polypeptide in (a) of claim 1. 

4. The isolated nucleic acid molecule of claim 3, wherein said epitope-bearing portion 
of a polypeptide comprises an amino acid sequence listed in Table 4. 

5. A method for making a recombinant vector comprising inserting an isolated nucleic 
acid molecule of claim 1 into a vector. 

6. A recombinant vector produced by the method of claim 5. 

7. A host cell comprising the vector of claim 6. 

8. A method of producing a polypeptide comprising: 

(a) growing ttie host cell of claim 7 such that the protein is expressed by the cell; and 

(b) recovering the expressed polypeptide. 

9. An isolated polypeptide comprising a polypeptide selected from the group 
consisting of: 

(a) a polypeptide consisting of one of the complete amino acid sequences of Table 1; 

(b) a polypeptide consisting of one the complete amino acid sequences of Table 1 except 
the N-terminal residue; 
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(c) a fragment of the polypeptide of (a) having biological activity; and 

(d) a fragment of the polypeptide of (a) which binds to an antibody specific for the 
polypeptide of (a). 

10. An isolated antibody specific for the polypeptide of claim 9. 

1 1. A polypeptide produced according to the method of claim 8. 

12. An isolated polypeptide comprising an amino acid sequence at least 95% identical to 
a sequence selected from the group consisting of an amino acid sequence of any one of 
the polypeptides in Table 1. 

13. An isolated polypeptide antigen comprising an amino acid sequence of an £. 
faecalis epitope shown in Table 4. 

14. An isolated nucleic acid molecule comprising a polynucleotide with a nucleotide 
sequence encoding a polypeptide of claim 9. 

15. A hybridoma which produces an antibody of claim 10. 

16. A vaccine, comprising: 

(1) one or more E. faecalis polypeptides selected from the group consisting of a 
polypeptide of claim 9; and 

(2) a pharmaceutically acceptable diluent, carrier, or excipient; 

wherein said polypeptide is present, in an amount effective to elicit protective 
antibodies in an animal to a member of the Enterococcus genus. 

17. A method of preventing or attenuating an infection caused by a member of the 
Enterococcus genus in an animal, comprising administering to said animal a polypeptide 
of claim 9, wherein said polypeptide is administered in an amount effective to prevent 
or attenuate said infection. 

18. A method of detecting Enterococcus nucleic acids in a biological sample 
comprising: 

(a) contacting the sample with one or more nucleic acids of claim 1, under conditions 
such that hybridization occurs, and 

(b) detecting hybridization of said nucleic acids to the one or more Enterococcus 
nucleic acid sequences present in the biological sample. 
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19. A method of detecting Enterococcus nucleic acids in a biological sample obtained 
from an animal, comprising: 

(a) amplifying one or more Enterococcus nucleic acid sequences in said sample using 
polymerase chain reaction, and 

(b) detecting said amplified Enterococcus nucleic acid. 

20. A kit for detecting Enterococcus antibodies in a biological sample obtained from an 
animal, comprising 

(a) a polypeptide of claim 9 attached to a solid support; and 

(b) detecting means. 

21. A method of detecting Enterococcus antibodies in a biological sample obtained 
from an animal, comprising 

(a) contacting the sample with a polypeptide of claim 9; and 

(b) detecting antibody-antigen complexes. 



